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Nucleotide Sequences for Transcriptional Regulation in 

Corynebacterium glutamicum 

Background of the Invention 

Field of the Invention 

The invention relates to the areas of microbial genetics and recombinant 
- DNA technology. The invention provides DNA sequences, vectors, 
microorganisms, and methods useful for inducing and regulating the expression 
of genes, including those that are involved in amino acid biosynthesis, in bacterial 
cells. 

Background of the Invention 

Coryneform bacteria are Gram-positive bacteria frequently used for 
industrial-scale production of amino acids, purines, and proteins. Although 
coryneform bacteria, particular Corynebacterium species, have been widely used 
for industrial purposes for many years, the techniques of molecular biology have 
only recently been employed to augment the usefulness of these organisms in the 
production of amino acids and other products. 

One way to improve the productivity of a microbial strain is to increase 
the expression of genes that control the production of a metabolite. Increasing 
expression of a gene can increase the activity of an enzyme that is encoded by that 
gene. Increasing enzyme activity can increase the rate of synthesis of the 
metabolic products made by the pathway to which that enzyme belongs. In some 
instances, increasing the rate of production of a metabolite can unbalance other 
cellular processes and inhibit growth of a microbial culture. The modified culture 
will make more product per cell, but will not be able to generate enough cells per 
volume to show an improvement over the parent strain in a fermentor. 

Transcription is the process by which an RNA molecule is synthesized 
from a DNA template and occurs by the interaction of a multisubunit enzyme 
complex, known as RNA polymerase, with a DNA molecule. The RNA that is 
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synthesized by this process ultimately directs the production of protein products 
within the cell. In general, the rate at which RNA is synthesized from DNA, /. e. , 
the transcription rate, directly influences the level of synthesis of the 
corresponding protein product. 

Promoters are DNA sequence elements that regulate the rate at which 
genes are transcribed. Promoters can influence transcription in a variety of ways. 
For example, some promoters direct the transcription of their associated genes at 
a constant rate regardless of the internal external cellular conditions. Such 
promoters are known as constitutive promoters. In many cases, however, a 
promoter will direct transcription of its associated gene only under very specific 
cellular conditions. For example, promoters that turn off gene expression during 
the growth phase of a microbial culture, but turn on gene expression after optimal 
growth has been achieved can be used to regulate genes that control production 
of a metabolite. The new strain will have the same growth pattern as the parent 
but produce more product per cell. This kind of modification can also improve 
titer (g product/liter) and yield (g product/g glucose). Nucleotide sequences have 
been identified that can be used to increase or decrease gene expression in 
Corynebacterium species. These regulatable promoters can increase or decrease 
the rate at which a gene is transcribed depending on the internal and/or the 
external cellular conditions. Frequently, the presence of a factor, known as an 
inducer, can stimulate the rate of transcription from a promoter. Inducers can 
interact directly with molecules that, themselves, physically interact with the 
promoter or with DNA sequences in the vicinity of the promoter. Alternatively, 
the action of an inducer in stimulating transcription from a promoter may be 
indirect. Whereas inducers function to amplify the level of transcription from a 
promoter, there is a class of factors, known as suppressors, that reduce or inhibit 
transcription from a promoter. Like inducers, suppressors can exert their effects 
either directly or indirectly. 

Besides regulation through inducers and suppressors, certain promoters 
are regulated by temperature. For instance, a the level of transcription from a 
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promoter may be increased when cells harboring that promoter are grown at a 
temperature that is greater than the optimum or normal growth temperature for 
that cell type. Similarly, there are promoters that will enhance gene expression 
in cells grown at temperatures below the normal growth temperature. 

Promoters are found naturally in wild-type cells where they regulate the 
expression of specific genes. Promoters, however, are also useful as tools of 
molecular biology in that they can be isolated from their normal cellular contexts 
and engineered to regulate the expression of virtually any gene. 

The use of regulatory sequences from Escherichia coli to control the 
expression of reporter genes in Corynebacterium have been documented. The 
lacl q repressor and the tac promoter/reporter genes from E. coli were on plasmids 
that replicate in Corynebacterium. See, e.g., Morinaga, Y. et a/., J. Biotechnol 
5:305-3 12 (1987). In addition, Ben-Samoun et al 7 FEMS Microbiology Letters 
1 74 : 1 25 ( 1 999), which is incorporated herein by reference in its entirety, di sclose 
the use of the E. coli araBAD promoter and the araC activator on a plasmid 
which replicates in Corynebacterium glutamicum cells to stimulate the expression 
of the GFPuv reporter gene only when L-arabinose is present in the growth 
medium. The authors acknowledge, however, that the level of expression from 
the araBAD promoter in C. glutamicum is 6.5 fold lower than that which was 
observed in E. coli, the native species for the promoter. 

U.S. Patent Nos. 5,693,78 1 and 5,762,299, each of which are incorporated 
herein by reference in their entireties, disclose the isolation of promoter 
sequences from the coryneform bacteria, Brevibacterium flavum. Sequences 
described in these patents were isolated on the basis of their ability to direct 
expression of a reporter gene in B. flavum at a level that was greater than the 
expression level observed in B. flavum with the synthetic tac promoter. Also 
disclosed in U.S. Patent Nos. 5,693,781 and 5,762,299, each of which are 
incorporated herein by reference in their entireties, are B. flavum promoters 
capable of expressing a reporter gene in B. flavum cells when grown in medium 

* 

containing: (a) ethanol but not glucose, and vice versa; (b) glucose but not 
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fructose; and (c) glucose but not casein hydrolysates/yeast extract/glucose, and 
vice versa. The novel promoter sequences disclosed in the present application are 
different from those described in U.S. Patent Nos. 5,693,781 and 5,762,299. 

A limited number of C glutamicum promoters have been described to 
date. For example, the C. glutamicum aceA promoter is disclosed in Wendisch 
etaL, Arch Microbiol. 168:262 (1997) and in U.S. Patent No. 5,700,661 (where 
it is termed the isocitrate lyase promoter), each of which are incorporated herein 
by reference in their entireties. In both of these references, the aceA promoter 
was linked to a reporter gene in transformed C glutamicum cells, and produced 
an extracellular protein, not a product of metabolic engineering. Expression of 
the reporter gene was found to be greater in C. glutamicum transformants that 

* 

were grown in the presence of acetate than it was for transformants grown in the 
presence of glucose (Wendisch et al.Arch Microbiol 1 68 :262 (1 997)) or sucrose 
(U.S. Patent No. 5,700,661). 

Similarly, the aceB promoter from C glutamicum is disclosed in 
Wendisch et al, Arch Microbiol 168:262 (1997) and in U.S. Patent No. 
5,965,391 , which is incorporated herein by reference in its entirety. Both of these 
references describe transcriptional fusions consisting of the aceB promoter region 
linked to a reporter gene in C. glutamicum transformed cells. Expression of the 
reporter gene was found to be greater in C. glutamicum transformants grown in 
acetate-containing medium than it was for transformants grown in glucose- 
containing medium (Wendisch et al 9 Arch Microbiol 168:262 (1997)) or other 
carbon sources (U.S. Patent No. 5,965,391). 

Reinscheid et al, Microbiology 145:503 (1999), which is incorporated 
herein by reference in its entirey, discloses a transcriptional fusion between the 
C. glutamicum pta-ack promoter and a reporter gene (chloramphenicol 
acetyltransferase). C. glutamicum cells harboring the transcriptional fusion 
demonstrated enhanced reporter gene expression when grown in acetate- 
containing medium as compared to transformed cells that were grown in glucose- 
containing medium. 
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In Patek et al., Microbiology 142:1297 (1996), which is incorporated 
herein by reference in its entirety, several DNA sequences from C glutamicum, 
identified on the basis of their ability to promote the expression of a 
chloramphenicol resistance reporter gene in C. glutamicum cells, are disclosed 
and compared to one another in an attempt to define a consensus sequence for C. 
glutamicum promoters. 

There is clearly a need for a broader assortment of well-defined 
Coiynebacterium species promoters than has been heretofore described. Such 
promoters would be useful in the constitutive and/or regulated expression of 
genes in coryneform cells. For example, a collection of C glutamicum 
promoters, regulated by inexpensive carbon sources, would facilitate the 
industrial-scale production of amino acids and purines in C glutamicum cells by 
enhancing the expression of genes that encode components of the biosynthetic 
pathways for the desired amino acids or purines. Likewise, a versatile array of 
coryneform promoters would be useful for the industrial scale production of 
heterologous polypeptides in C glutamicum cells by stimulating the enhanced 
expression of genes encoding such heterologous polypeptides. 

Summary of the Invention 

The present invention relates to isolated polynucleotides that function as 
transcriptional regulators, in particular, promoters, in Corynebacterium species 
host cells, preferably Corynebacterium glutamicum, Brevibacterium flavum, or 
Brevibacterium lactofermentum host cells, even more preferably C. glutamicum. 
host cells. These promoters are useful for regulating and enhancing the 
production of a variety of products in such host cells. Examples of products, the 
production of which may be enhanced in Corynebacterium species host cells as 
a result of the present invention, are amino acids, such as lysine; purine 
nucleotides, such as inosinic acid; and heterologous polypeptides. Since 
Corynebacterium species are especially useful for the industrial-scale production 
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of amino acids, purines, and polypeptides, use of the promoters of the present 
invention may greatly improve the yields of these products from 
Corynebacterium species host cells. 

In one embodiment of the present invention, an isolated polynucleotide 
5 is provided, comprising a first nucleic acid, the sequence of which is selected 

from the group consisting of SEQ ID NO:4 through 22. 

Further embodiments of the invention include and isolated polynucleotide 
comprising a first nucleic acid, the sequence of which is at least 90% identical, 
and more preferably at least 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% 
10 identical, to a sequence selected from the group consisting of SEQ ID NOs: 4 

through 22. 

Additional embodiments of the invention include an isolated 
polynucleotide comprising a first nucleic acid, the sequence of which comprises 
about 6 contiguous nucleotides, preferably about 1 0 contiguous nucleotides, even 

15 more preferably about 12, 15, 20, 30, 40, 50, 100, 150, 200, or 500 contiguous 

nucleotides, of a sequence selected from the group consisting of SEQ ID NOs: 4 
through 22, wherein the first nucleic acid is capable of regulating transcription of 
a second nucleic acid, preferably as part of a promoter. 

In one aspect of this embodiment, the promoter comprises any one or 

20 more of the following genetic elements : a minus 1 0 ("- 1 0") sequence; a minus 3 5 

("-35") sequence; a transcription initiation site; an enhancer region; and an 
operator region. Preferably the genetic elements are specific for Corynebacterium 
species, more preferably are specific for Corynebacterium glutamicum, 
Brevibacterium flavum, and Brevibacterium lactofermentum y and even more 

25 preferably are specific for Corynebacterium glutamicum. 

Additional embodiments of the invention include an isolated 
polynucleotide comprising a first nucleic acid at least 1 0 nucleotides in length, 
preferably at least 12, 15, 30, 50 or 150 nucleotides in length, which hybridizes 
under stringent conditions to a reference nucleic acid, or the complement thereof, 

30 wherein the sequence of said reference nucleic acid is selected from the group 



WO 02/40679 



PCT7US0 1/43096 



-7- 

consisting of SEQ ID NOs: 4 through 22. The meaning of the phrase "stringent 
conditions" as used herein is described infra. 

In further embodiments of the present invention, an isolated 
polynucleotide is provided, comprising a first nucleic acid as described above; 
and a second nucleic acid. According to this embodiment, the first nucleic acid 
comprises a transcriptional regulatory region, preferably, a promoter. 

In one aspect of this embodiment, said second nucleic acid encodes one 
or more polypeptides. Preferably, the physical location of the first nucleic acid 
relative to the second nucleic acid is such that, under the appropriate conditions, 
the first nucleic acid regulates transcription of the second nucleic acid, thereby 
facilitating production of the polypeptide. 

In another aspect of this embodiment, the second nucleic acid encodes one 
or more components of a purine biosynthesis pathway. One example of a purine 
biosynthesis pathway included in this aspect of the invention is the enzymatic 
pathway that results in the synthesis of inosinic acid. 

In another aspect of this embodiment, the second nucleic acid encodes one 
or more heterologous polypeptides. The heterologous polypeptide may be one 
that is from a Corynebacterium species or one that is from a non- 
Corynebacterium species. 

In one particularly preferred aspect of this embodiment, an isolated 
polynucleotide is provided, comprising a first nucleic acid as described above; 
and a second nucleic acid, which encodes a component of an amino acid 
biosynthesis pathway. Examples of amino acid biosynthesis pathways included 
in this aspect of the invention are the enzymatic pathways that result in the 
synthesis of L-glycine, L-alanine, L-leucine, L-methionine, L-phenylalanine, L- 
tryptophan, L-lysine, L-glutamine, L-glutamic acid, L-serine, L-proline, L-valine, 
L-isoleucine, L-cysteine, L-tyrosine, L-histidine, L-arginine, L-asparagine, L- 
aspartic acid, and L-threonine. 

In a preferred aspect of this embodiment, the second nucleic acid encodes 
one of the following components of the L-lysine biosynthesis pathway: 
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aspartokinase, aspartate beta-semialdehyde dehydrogenase, diaminopimelate 
dehydrogenase, diaminopimelate decarboxylase, dihydrodipicolinate synthetase, 
dihydrodipicolinate reductase, or pyruvate carboxylase. 

Another embodiment of the invention provides an isolated polynucleotide 
comprising a first nucleic acid, the sequence of which is at least 90%, 91%, 92%, 
93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 2, and a 
second nucleic acid which encodes a polypeptide which functions as a component 
of the lysine biosynthesis pathway. In this embodiment, the first nucleic acid 
regulates transcription of the second nucleic acid. 

Yet another embodiment of the invention provides an isolated 
polynucleotide comprising a first nucleic acid, the sequence of which is at least 
90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID 
NO: 3; and a second nucleic acid which encodes polypeptide which functions as 
a component of an amino acid biosynthesis pathway, preferably a lysine 
biosynthesis pathway. In this embodiment, the first nucleic acid regulates 
transcription of the second nucleic acid. 

Further embodiments of the invention provide isolated Corynebacterium 
chromosomes with a first nucleic acid integrated into the chromosome, the 
sequence of which is one of the following: a sequence which is at least 90%, 
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 1; 
a sequence of which is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 
98%, or 99% identical to SEQ ID NO:23, a sequence of which is at least 90%, 
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID 
NO:30, or a sequence which is identical to either SEQ ID NO:26 or SEQ ID 
NO:27. In the embodiments where the integrated first nucleic acid has a 
sequence identical to either SEQ ID NO:26 or SEQ ID NO:27, the isolated 
chromosome also has a third nucleic acid integrated therein, the sequence of 
which is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% 
identical to SEQ ID NO:28, operably linked to a transcription control region, 
wherein the nucleic acid encodes a polypeptide at least 90%, 91%, 92%, 93%, 
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94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO:29. In these 
embodiments, the isolated Corynebacterium species chromosome also has a 
second nucleic acid integrated therein, wherein said second nucleic acid encodes 
polypeptide which functions as a component of an amino acid biosynthesis 
pathway, preferably, a lysine biosynthesis pathway. Additionally in these 
embodiments, the first nucleic acid regulates transcription of the second nucleic 
acid. 

In another aspect of the above embodiments, a Corynebacterium species 
host cell comprising a Corynebacterium species chromosome as described above 
is provided, preferably a host cell derived from Corynebacterium glutamicum, 
Brevibacterium flavum, and Brevibacterium lactofermentum. In a related aspect, 
a method of producing such a Corynebacterium species host cell is provided, 
which comprises transforming Corynebacterium species cells with a vector 
comprising a first nucleic acid as described above, wherein said vector facilitates 
integration of the first nucleic acid into the chromosome of said Corynebacterium 
species cells, and selecting the host cell in certain preferred embodiments, the 
vector also comprises the second nucleic acid described above, and/or the third 
nucleic acid described above, with the first nucleic acid being physically situated 
to regulate transcription of the second nucleic acid. These vectors are also 
provided. 

Examples of amino acid biosynthesis pathways which may be included 
in the above embodiments of the invention are the enzymatic pathways that result 
in the synthesis of L-glycine, L-alanine, L-leucine, L-methionine, L- 
phenylalanine, L-tryptophan, L-lysine, L-glutamine, L-glutamic acid, L-serine, L- 
proline, L-valine, L-isoleucine, L-cysteine, L-tyrosine, L-histidine, L-arginine, L- 
asparagine, L-aspartic acid, and L-threonine. 

In those embodiments where the second nucleic acid encodes a 
component of the L-lysine biosynthesis pathway, the second nucleic acid may 
encode one or more of the following enzymes: aspartokinase, aspartate beta- 
semialdehyde dehydrogenase, diaminopimelate dehydrogenase, diaminopimelate 
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decarboxylase, dihydrodipicolinate synthetase, dihydrodipicolinate reductase, or 
pyruvate carboxylase. 

In another embodiment of the present invention, a method is provided for 
producing a vector. More specifically, the method comprises inserting into a 
vector any of the isolated polynucleotides described herein. 

In another embodiment of the present invention, a vector is provided 
comprising any of the isolated polynucleotides described herein. 

In one aspect of this embodiment, the vector further comprises a multiple 
cloning region into which a heterologous second nucleic acid can be inserted, 
thereby allowing for the regulated and/or elevated expression of a variety of 
second nucleic acids therein. 

In another aspect of this embodiment, said vector is a shuttle vector. As 
used herein, the term "shuttle vector" refers to a vector that can replicate and be 
maintained in more than one host cell species. In a preferred aspect of this 
embodiment, said shuttle vector can replicate and be maintained in a 
Corynebacterium species host cell, preferably a host cell derived from 
Coiynebacterium glutamicum, Brevibacterium flavum, and Brevibacterium 
lactofermentum, and in an E. coli host cell. 

In another embodiment of the present invention, a method of producing 
a transformed Corynebacterium species host cell is provided. The method of this 
embodiment comprises introducing into Corynebacterium species cells, 
preferably Corynebacterium glutamicum, Brevibacterium flavum, or 
Brevibacterium lactofermentum cells, a vector comprising any of the isolated 
polynucleotides described herein. 

In one aspect of this embodiment, the polynucleotide, after being 
introduced into the host cell, is integrated into the chromosome of the host cell. 
In another aspect of this embodiment, the polynucleotide, after being introduced 
into the host cell, is maintained as an extrachromosomal element; i. e. , it does not 
integrate into the chromosome of the host cell. 
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In another embodiment of the invention, a method is provided for the 
production of a biosynthetic product such as an amino acid, a purine nucleotide, 
or a heterologous polypeptide. According to this embodiment, a transformed 
Corynebacterium species host cell harboring a vector of the invention is used. 
More specifically, the vector comprises any isolated polynucleotide described 
herein. 

In a preferred aspect of this embodiment, the polynucleotide comprises a 
first nucleic acid which regulates transcription of a second nucleic acid, where the 
second nucleic acid encodes a polypeptide which functions in an amino acid 
biosynthesis pathway, as described above. In this aspect, the first nucleic acid 
regulates transcription of the second nucleic acid, thereby resulting in elevated 
production of the amino acid by the transformed host cell. 

In another aspect of this embodiment, the polynucleotide comprises a first 
nucleic acid which regulates transcription of a second nucleic acid, where the 
second nucleic acid encodes a component of a purine biosynthesis pathway. In 
this aspect, the first nucleic acid regulates transcription of the second nucleic acid, 
thereby resulting in elevated production of the purine by said transformed host 
cell. 

In yet another aspect of this embodiment, the polynucleotide comprises 
a first nucleic acid which regulates transcription of a second nucleic acid, where 
the second nucleic acid encodes a heterologous polypeptide. In this aspect, the 
first nucleic acid regulates transcription of the second nucleic acid, thereby 
resulting in elevated production of the heterologous polypeptide by said 
transformed host cell. 

In certain preferred aspects of this embodiment, the first nucleic acid 
comprises a promoter. Preferred promoters include, but are not limited to the 
following types of promoters: a constitutive promoter, an inducible promoter, a 
derepressable promoter, a heat sensitive promoter, and a cold sensitive promoter. 

Where the promoter is an inducible promoter, the method for the 
production of a biosynthetic product as above may further comprise adding an 
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inducer to the culture medium. The inducer may be present throughout the 
growth of the host cell, or alternatively, may be added to the culture medium after 
the host cell has grown to an optimal density. In a preferred aspect, the host cell 
is self-induced, i.e. , the inducer is produced by said host cell at some point during 
its growth cycle. When the inducer is present in the culture medium, the first 
polynucleotide facilitates enhanced production of the biosynthetic product. 
Inducers which may be used in the invention include, but are not limited to acetic 
acid, pyruvate, ethanol, starch subunits, sugars, cellulose subunits, fatty acids, and 
triglycerides. Sugars that may be used as inducers in the present invention 
include, but are not limited to fructose, maltose, lactose, and arabinose. 

Where the promoter is a derepressable promoter, the method for the 
production of a biosynthetic product as above may further comprise use of a 
culture medium which lacks a suppressor. In this aspect, the suppressor may be 
absent from the culture medium throughout the growth of said host cell, or the 
host cell may be initially cultured in a culture medium containing the suppressor, 
and be transferred to a culture medium lacking said suppressor when the host cell 
culture has reached an optimal density. In certain preferred aspects, the host cell 
may be self-regulating, i.e., the suppressor may be removed from said culture 
medium through metabolic depletion by the host cell. When the suppressor is 
removed or depleted from the culture medium, the first polynucleotide facilitates 
enhanced production of the biosynthetic product. Suppressors which may be used 
in the invention include, but are not limited to purines, pyrimidines, amino acids 
such as histidine, and oxygen. 

Where the promoter is a heat sensitive promoter or a cold sensitive 
promoter, the method for the production of a biosynthetic product as above may 
comprise adjustment of the temperature of the growth medium to a temperature 
which is greater than or less than the optimal growth temperature for an 
untransformed Corynebacterium species cell. For example, the yield of the 
biosynthetic product may be increased when said host cell is grown at a 
temperature adjusted to be greater than the optimal growth temperature for an 
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untransformed Corynebacterium species cell, or the yield of the biosynthetic 
product may be increased when the host cell is grown at a temperature adjusted 
to be less than the optimal growth temperature for an untransformed 
Corynebacterium species cell. 

5 Detailed Description of the Preferred Embodiments 

The present invention relates to isolated polynucleotides that function as 
transcriptional regulators, in particular, promoters, in Corynebacterium species 
host cells, preferably Corynebacterium glutamicum, Brevibacterium flavum, or 
Brevibacterium lactofermentum host cells, even more preferably C glutamicum. 

10 host cells. These promoters are useful for regulating and enhancing the 

production of a variety of products in such host cells. Sources of promoters 
include nucleotide sequences from the 5 r end of native chromosomal genes from 
Corynebacterium species, from sequences on plasmids that replicate in 
Corynebacterium species, from sequences in the genome of phage that infect 

15 Corynebacterium species, from sequences derived from other microorganisms, 

e.g., Escherichia coli, or from sequences assembled by humans (tac, trc) which 
are not found in nature. Genes of ribosomal proteins, ribosomal RNAs and 
elongation factors show high levels of expression. The promoters of these genes 
are candidates for increasing expression of amino acid biosynthetic pathway 

20 genes. 

Another reason for changing promoters of genes in biosynthetic pathways 
is to make the pathway independent of factors that control the pathway in the wild 
type organism. For example the native promoter of the operon that contains 
diaminopimelate decarboxylase of the lysine biosynthetic pathway of C 
25 glutamicum can respond to arginine or lysine in the growth medium. Arginine 

increased transcription three-fold and lysine decreased transcription by one third 
(QgaiTJ^etaUJBact 775:7356-7362(1993)). Diaminopimelate decarboxylase 
activity decreased 60% in cells grown in minimal medium supplemented with 



« 
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lOmmM lysine (Cremer, et al, J Gen Microbiol. 134:3221-3229 (1988)). 
Replacing the promoter of lysA which encodes the diaminopimelate 
decarboxylase is one way to make lysine biosynthesis independent of arginine and 
lysine levels in media. 

Gene expression from many of the transcriptional regulators of the 
present invention can be easily and inexpensively regulated by manipulating the 
composition and/or temperature of the medium in which cells containing these 
promoters are cultured. Regulated expression from a promoter is especially 
useful for controlling the production of a gene product that is toxic to cells at high 
levels. To allow for maximum production of a gene product, a culture of cells 
possessing the gene of interest under the control of an inducible promoter can be 
first grown to a sufficient cell density under conditions in which expression from 
the promoter is suppressed. Then, by manipulating the culture medium, 
expression from the promoter is induced. This strategy assures not only a high 
level of gene expression in the individual cells, but also a sufficiently high 
number of cells in the culture, so that maximum production of the gene product 
by the entire cell population is achieved. 

A Definitions 

In order to provide a clear and consistent understanding of the 
specification and claims, including the scope to be given such terms, the 
following definitions are provided. It is also to be noted that the term "a" or "an" 
entity, refers to one or more of that entity; for example, "a polynucleotide," is 
understood to represent one or more polynucleotides. As such, the terms "a" (or 
"an"), "one or more," and "at least one" can be used interchangeably herein. 

Auxotroph. As used herein, the term refers to a strain of microorganism 
requiring for growth an external source of a specific metabolite that cannot be 
synthesized because of an acquired genetic defect. 
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Amino Acid Supplement. As used herein, the term refers to an amino 
acid required for growth and added to minimal media to support auxotroph 
growth. 

Chromosomal Integration. As used herein, the term refers to the 
insertion of an exogenous DNA fragment into the chromosome of a host 
organism; more particularly, the term is used to refer to homologous 
recombination between an exogenous DNA fragment and the appropriate region 
of the host cell chromosome. 

Corynebacterium species. As will be understood by those skilled in the 
art, the terms "Corynebacterium" or "Corynebacterium species" includes those 
organisms previously identified in the literature as "Brevibacterium species," for 
example Brevibacterium flavum and Brevibacterium lactofermentum which have 
now been reclassified into the genus Corynebacterium. (Int. J. Syst. Bacteriol 41 : 
255 (1981)). Accordingly, the term "Corynebacterium species" is used herein 
interchangably with "Corynebacterium species and Brevibacterium species." 

Enhancers. As used herein, the term refers to a DNA sequence which can 
stimulate promoter activity and may be an endogenous element or a heterologous 
element inserted to enhance the level, i.e., strength of a promoter. 

Inducer. As used herein, the term "inducer" refers to molecule which 
acts to stimulate transcription from an inducible promoter. The inducer may be 
produced by a host cell, or added to a culture medium in which the host cell is 
being grown. 

Isolated Polynucleotide. As used herein, the term is intended to mean a 
polynucleotide, DNA or RNA, which has been removed from its native 
environment. For example, recombinant DNA molecules contained in a vector 
are considered isolated for the purposes of the present invention. Further 
examples of isolated DNA molecules include recombinant DNA molecules 
maintained in heterologous host cells or purified (partially or substantially) DNA 
molecules in solution. However, a nucleic acid molecule contained in a clone 
that is a member of a mixed clone library (e.g., a genomic or cDNA library) and 
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that has not been isolated from other clones of the library (e.g., in the form of a 
homogeneous solution containing the clone without other members of the library) 
or a chromosome isolated or removed from a cell or a cell lysate, is not "isolated" 
for the purposes of this invention. Isolated RNA molecules include in vivo or in 
5 vitro RNA transcripts of the DNA molecules of the present invention. Isolated 

nucleic acid molecules according to the present invention further include such 
molecules produced synthetically. 

. Lysine Biosynthetic Pathway Protein. As used herein, the term "lysine 
biosynthetic pathway protein" is meant to include those peptides, polypeptides 
10 or proteins, and enzymes, which are directly involved in the synthesis of lysine 

r 

from aspartate. These proteins may be identical to those which naturally occur 
within a host cell and are involved in the synthesis of lysine within that host cell. 
Alternatively, there may be modifications or mutations of such proteins, for 
example, the proteins may contain modifications or mutations which do not 

1 5 significantly affect the biological activity of the protein. For example, the natural 

protein may be modified by mutagenesis or by introducing or substituting one or 
more amino acids, preferably by conservative amino acid substitution, or by 
removing nonessential regions of the protein. Such modifications are readily 
performed by standard techniques. Alternatively, lysine biosynthetic proteins 

20 may be heterologous to the particular host cell. Such proteins may be from any 

organism having genes encoding proteins having the same, or similar, 
biosynthetic roles. . 

Mutagenesis. As used herein, the term refers to a process whereby a 
mutation is generated in DNA. With "random" mutagenesis, the exact site of 

25 mutation is not predictable, occurring anywhere in the genome of the 

microorganism, and the mutation is brought about as a result of physical damage 
caused by agents such as radiation or chemical treatment. rDNA mutagenesis is 
directed to a cloned DNA of interest, and it may be random or site-directed. 

Mutation. As used herein, the term refers to a one or more base pair 

30 changes, insertion or deletion in the nucleotide sequence of interest. 
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Operably Associated. As used herein, the term "operably associated" 
refers to a association of nucleic acid elements in a functional relationship. A 
nucleic acid is "operably associated" when it is placed into a functional 
relationship with another nucleic acid sequence. For instance, a promoter or 
enhancer is operably associated with a polypeptide coding region if it affects the 
transcription of the polypeptide coding region. Operably associated means that 
the nucleic acids being associated are typically close together or contiguous and, 
where necessary, join two polypeptide coding regions, contiguous and in reading 
frame. However, since enhancers generally function when separated from the 
promoter by several kilobases, some polynucleotide elements may be operably 
associated but not close together or contiguous. 

Operon. As used herein, the term refers to a contiguous portion of a 
transcriptional complex in which two or more open reading frames encoding 
polypeptides are transcribed as a multi-cistronic messenger RNA, controlled by 
a cis-acting promoter and other cis-acting sequences necessary for efficient 
transcription, as well as additional cis acting sequences important for efficient 
transcription and translation (e.g., mRNA stability controlling regions and 
transcription termination regions). 

Parental Strain. As used herein, the term refers to a strain of host cell 
subjected to some form of treatment to yield the host cell of the invention. 

Phenotype. As used herein, the term refers to observable physical 
characteristics dependent upon the genetic constitution of a host cell. 

Promoter. As used herein, the term "promoter" has its art-recognized 
meaning, denoting a portion of a gene containing DNA sequences that provide for 
the binding of RNA polymerase and initiation of transcription and thus refers to 
a DNA sequence capable of controlling the* expression of a coding sequence or 
functional RNA. Promoter sequences are commonly, but not always, found in the 
5' non-coding regions of genes, upstream of one or more open reading frames 
encoding polypeptides. Sequence elements within promoters that function in the 
initiation of transcription are often characterized by consensus nucleotide 
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sequences. The promoter sequence includes proximal and more distal upstream 
elements. Examples of proximal elements in bacterial promoters include a -10 
region and a -35 region, which are discussed in more detail, infra. Examples of 
more distal elements include operator regions and enhancer regions. As used 
5 herein, the term "endogenous promoter' 1 refers to a promoter sequence which is 

a naturally occurring promoter sequence in that host microorganism. The term 
"heterologous promoter" refers to a promoter sequence which is a non-naturally 
occurring promoter sequence in that host microorganism. The non-naturally 
occurring promoter sequence may be from any prokaryotic or eukaryotic 

10 organism. A synthetic promoter is a nucleotide sequence, having promoter 

activity, and not found naturally occurring in nature. 

Promoters may be derived in their entirety from a native gene, or be 
hybrid promoters. Hybrid promoters are composed of different elements derived 
from different promoters found in nature, or even comprise synthetic DNA 

1 5 segments. Hybrid promoters may be constitutive, inducible or environmentally 

responsive. 

Useful promoters include constitutive and inducible promoters. Many 
such promoter sequences are known in the art. See, for example, U.S. Pat. Nos. 
4,980,285; 5,631,150; 5,707,828; 5,759,828; 5,888,783; 5,919, 670, and, 

20 Sambrook, et al., Molecular Cloning: ALaboratory Manual, 2nd Ed., Cold Spring 

Harbor Press (1 989). Other useful promoters include promoters which are neither 
constitutive nor responsive to a specific (or known) inducer molecule. Such 
promoters may include those that respond to developmental cues (such as growth 
phase of the culture), or environmental cues (such as pH, osmoticum, heat, 

25 dissolved gases, or cell density). 

Examples of environmental conditions that may effect transcription by 
inducible promoters include anaerobic conditions, elevated temperature, reduced 
temperature, or the presence of light. It is understood by those skilled in the art 
that different promoters may direct the expression of a gene in different cell types, 

30 or in response to different environmental conditions. Promoters which cause a 
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gene to be expressed in most cell types at most times are commonly referred to 
as "constitutive promoters". It is further recognized that since in most cases the 
exact boundaries of regulatory sequences have not been completely defined, DNA 
fragments of different lengths may have identical or similar promoter activity. 

Relative Growth. As used herein, the term refers to a measurement 
providing an assessment of growth by directly comparing growth of a parental 
strain with that of a progeny strain over a defined time period and with a defined 
medium. 

Stringent Hybridization Conditions. As used herein, the term "stringent 
hybridization conditions" is intended to mean overnight incubation at 42 °C in a 
solution comprising: 50% formamide, 5x SSC (750 mMNaCl, 75mM trisodium 
citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 1 0% dextran 
sulfate, and 20 g/ml denatured, sheared salmon sperm DNA, followed by washing 
the filters in O.lx SSC at about 65°C. 

Suppressor or Repressor. As used herein, the terms "suppressor" or 
"repressor" refer to molecules which act to block or reduce transcription from an 
derepressable promoter. The suppressor or repressor may be produced by a host 
cell, or added to a culture medium in which the host cell is being grown. 
Furthermore, during the growth of a host cell, the suppressor substance can be 

■ 

metabolized by the host cell, thereby removing it from the culture medium, 
thereby increasing transcription from the derepressable promoter. 

Transcription factor. As used herein, the term "transcription factor" 
refers to RNA polymerases, and other proteins that interact with DNA in a 
sequence-specific manner and exert transcriptional regulatory effects. 
Transcriptional factors may be transcription inhibitory proteins or transcription 
activator proteins. In the context of the present invention, binding sites for 
transcription factors (or transcription complexes) are often included in the 
transcriptional regulatory element(s). 

Transcription factor recognition site. As used herein, a "transcription 
factor recognition site" and a "transcription factor binding site" refer to a 
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polynucleotide sequence(s) or sequence motif(s) which are identified as being 
sites for the sequence-specific interaction of one or more transcription factors, 
frequently taking the form of direct protein-DNA binding. Transcription factor 
recognitions sites which bind transcription inhibitory proteins or transcription 
activator proteins are also referred to as "operator regions." Typically, 
transcription factor binding sites can be identified by DNA footprinting, gel 
mobility shift assays, and the like, and/or can be predicted on the basis of known 
consensus sequence motifs, or by other methods known to those of skill in the art. 

Transcriptional Complex. As used herein, the term "transcriptional 
unit" or "transcriptional complex" refers to a polynucleotide that comprises one 
or more coding regions for polypeptides, a cis-acting linked promoter and other 
cis-acting sequences necessary for efficient transcription of the structural 
sequences, distal regulatory elements necessary for appropriate transcription of 
the structural sequences, and additional cis sequences important for efficient 
transcription and translation (e.g., mRNA stability controlling regions and 
transcription termination regions). In bacteria, a transcriptional complex may 
comprise a single coding region, or one or more coding regions, e.g., as part of 
an operon. 

Transcriptional Regulatory Element. As used herein, the term 
"transcriptional regulatory element" refers to a DNA region which activates 
transcription alone or in combination with one or more other DNA regions. A 
transcriptional regulatory element can, for example, comprise a promoter, 
response element, negative regulatory element, silencer element, gene 
suppressor, transcription terminator, and/or enhancer. 

B. Microbiological and Recombinant DNA Methodologies 

The invention as provided herein utilizes some methods and techniques 
that are known to those skilled in the arts of microbiology and recombinant DNA 
technologies. Methods and techniques for the growth of bacterial cells, the 



WO 02/40679 



-21- 



PCT/US01/43096 



introduction of isolated DNA molecules into host cells, and the isolation, cloning 
and sequencing of isolated nucleic acid molecules, etc., are a few examples of 
such methods and techniques. These methods and techniques are described in 
many standard laboratory manuals, such as Davis et al t Basic Methods In 
Molecular Biology (1 986), J.H. Miller, Experiments in Molecular Genetics^ Cold 
Spring Harbor Laboratory' Press, Cold Spring Harbor, New York (1972); J.H. 
Miller, A Short Course in Bacterial Genetics, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, New York (1992); M. Singer and P. Berg, Genes & 
Genomes, University Science Books, Mill Valley, California (1991); J. 
Sambrook, E.F. Fritsch and T. Maniatis, Molecular Cloning: A Laboratory 
Manual, 2d ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York (1989); P.B. Kaufman etal.,Handbookof Molecular and Cellular Methods 
in Biology and Medicine, CRC Press, Boca Raton, Florida (1995); Methods in 
Plant Molecular Biology and Biotechnology, B.R. Glick and J.E. Thompson, eds., 
CRC Press, Boca Raton, Florida (1993); and P.F. Smith-Keary, Molecular 
Genetics of Escherichia coli, The Guilford Press, New York, NY (1989), all of 
which are incorporated herein by reference in their entireties. 

Unless otherwise indicated, all nucleotide sequences newly described 
herein were determined using an automated DNA sequencer (such as the Model 
373 from Applied Biosystems, Inc.). Therefore, as is known in the art, for any 
DNA sequence determined by this automated approach, any nucleotide sequence 
determined herein may contain some errors. Nucleotide sequences determined 
by automation are typically at least about 90% identical, more typically at least 
about 95% to at least about 99.9% identical to the actual nucleotide sequence of 
the sequenced DNA molecule. The actual sequence can be more precisely 
determined by other approaches including manual DNA sequencing methods well 
known in the art. 
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C Polynucleotides 

Certain embodiments of the present invention are directed to an isolated 
polynucleotide comprising a first nucleic acid, the sequence of which is related 
to, or identical to, a nucleotide sequence, or fragment thereof, selected from the 
group consisting of SEQ ID NO: 1 , SEQIDNO:2, SEQIDNO:3, SEQIDNO:4, 
SEQ IDNO:5, SEQ IDNO:6, SEQ IDNO:7, SEQ IDNO:8, SEQ IDNO:9, SEQ 
ID NO:10, SEQ ID NO:l 1, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, 
SEQ ID NO:15, SEQ h NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID 
NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ 
ID NO:24, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, and SEQ ID NO:30. 
SEQ ED NOs 1-3 have been previously described. See, e.g., Reinscheid et al, 
Microbiology 145:503 (1999); U.S. Patent No. 5,700,661; and U.S. Patent No. 
5,965,391. SEQ ID NOs 23, 24, 26, 27, 28, 30 and 33-35 have been previously 
described. See, e.g., Ben-Samoun et al, FEMS Microbiology Letters 174:125- 
130 (1999) Brosius, J. et al, 1 Biol Chem. 260, 3539-3541 (1985), and Amann, 
E.,etaL, Gene69: 301-315 (1988), Moeckel, etal, J.Bacteriol 174:8065-8072 
(1992 ), Keilhauer, et al J.Bacteriol 175:5595-5603 (1993), Patek et al, Appl 
Env. Microbiol, 60:133-140 (1994). Isolation of polynucleotides comprising 
nucleic acids the sequence of which are identical to SEQ ID NOs 4-22 are 
described herein. While not being bound by theory, it is believed that 
polynucleotides having these sequences can be used to regulate gene expression 
in Coiynebacterium species. The genes regulated by SEQ ID NOs 4-22, 
determined by similarity of the genes to genes identified in other organisms, are 
listed in Table 1 A, along with putative exogenous regulatory molecules. SEQ ID 
NOs 1-22 are listed in Table IB. 
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TABLE 1A 

Nucleotide sequences that can be used to regulate gene expression 



Seq. I.D. NO: 


Gene* 


Regulatory Molecule* 


1 


pta 


acetate 


2 


aceA 


acetate 


3 


aceB 


acetate 


4 


adh 


ethanol 


5 


aldB 


ethanol 


6 


poxB 


pyruvate 


7 


ldh 


pyruvate 


8 


amyE 


carbon 


9 


malZ 


carbon 


10 


bglX 


carbon 


11 


gam 


carbon 


12 




carbon 


13 


hisD 


histidine 


14 


pyrR 


pyrimidine 


15 


purD 


purine 


16 


hrcA 


temperature 


17 


htpX 


temperature 


18 


dnaK 


temperature 


19 


etc 


temperature 


20 


grpE 


temperature 


21 


clpB 


temperature 


22 


narA 


oxygen 



Sequence LD. NOs 1, 2, and 3 have been previously described. The remaining 
sequences were discovered in ADM's Corynebacteriwn glatamicum genome sequencing 
project. 

* Putative genes regulated by sequence LD. NOs 4-22 were determined by homology 
to genes identified in other organisms, e.g., Escherichia coli or Bacillus subtilis. 
x Putative regulatory molecules associated with the regulatory regions of SEQ LD. NOs 
4-22 were determined by analogy to regulatory regions identified in other organisms. 
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Table IB 

Nucleotide sequences that can be used to regulate gene expression 



SEQ ID NO: 1 

CACCGACAACGGCAACACGCAAAGGGCGAGACATATAAAGTTCGATTCCTTAAAGGGGTTCTAAAAAATG 
TGGAGTATGTGAGCGGGGTTCCACTAGTAGATTCGACTCCTATCGGGGTGCGACTGCTAATGGTGCCCTG 
CTATCAACCCTCCATGATACGTGGTAAGTGCAGACTAATAAAGGCCAGTCGGGGAGGATTGGGGGCTTTG 
CTGGGGGCAGATTTGTCACGCTGCGCGCTTTCATAGACCCCATTAATGTGGGGTGAAGAGCTGTAAAGTA 
CCGCTAAAAACTTTGCAAAGGGTGCTTCGCAACTTGTAACCGCTCCGTATTGTTTTCTACGGCAATAAGC 
ATTTGTGCTGCTCAAAGCGTGGAATTGAGATCGGTTTGAAAATTACAAAATAAAACTTTGCAAACCGGGC 
TGTACGCAAGGCGGACGAACGCTAAACTATGTAAGAAATCAC7UVCTTCCCCTCAGTAGTGCCAGGAGGCA 
CAAGCCTGAA 



SEQIDNO:2 

ACTCTTTTAAGAAAAGCACTCTGACTACCTCTGGAATCTAGGTGCCACTCTTCTTTCGATTTCAACCCTT 
ATCGTGTTTGGCGATGTGATCAGACTAAGTGATCACCGTCACCAGCAAAAGGGGTTTGCGAACTTTACTA 
AGTCATTACCCCCGCCTAACCCCGACTTTTATCTAGGTCACACCTTCAAAACCTACGGAACGTTGCGGTG 
CCTGCATTTTCCCATTTCAGAGCATTTGCCCAGTACATCTGTACTAGCAACTCCCCCGCCCACTTTTTCT 
GCGAAGCCAGAACTTTGCAAACTTCACAACAGGGGTGACCACCCCCGCACAAAACTTAAAAACCCAAACC 
GATT G ACGCACCAAT G CCCG ATGGAG CAAT GT GTGAACCACGCC ACC ACG CAAACCGATGCACAT C ACGT 
CGAAACAGTGACAGTGCATTAGCTCATACTTTGTGGTTGGCACCGCCCATTGCGAATCAGCACTTAAGGA 
AGTGACTTTG 



SEQ ID NO:3 

TTGCGTGGTGGCGTGGTTCACACATTGCTCCATCGGGCATTGGTGCGTCAATCGGTTTGGGTTTTTAAGT 
TTTGTGCGGGGGTGGTCACCCCTGTTGTGAAGTTTGCAAAGTTCTGGCTTCGCAGAAAAAGTGGGCGGGG 
GAGTTGCTAGTACAGATGTACTGGGCAAATGCTCTGAAATGGGAA7\ATGCAGGCACCGCAACGTTCCGTA 
GGTTTTGAAGGTGTGACCTAGATAAAAGTCGGGGTTAGGCGGGGGTAATGACTTAGTAAAGTTCGCAAAC 
CCCTTTTGCTGGTGACGGTGATCACTTAGTCTGATCACATCGCCAAACACGATAAGGGTTGAAATCGAAA 
GAAGAGTGGCACCTAGATTCCAGAGGTAGTCAGAGTGCTTTTCTTAAAAGAGTTTTCACAACCGTTAACG 
GCGTAGCCAAACAAGAAGGATTCGCATTCTTCTGGTTTAGGCACAGGTCATCTAAACCCATGCTTTAAAA 
GGAGCCTTCA 



SEQ ID NO:4 

ATTGGAAAATGGAGATTAGAAGAATCCTGGGAATGTTGGTGTGTGTTGCATGTCTGTTGAGACTATCTAG 
TAGATGCGGCTGCGCTCCTTAATTGCATGCTGGGGTGGTGGGGAATGGGTGGTTGGGGGCGTCGAAAAGC 
ATTTTTGGTGCTTCTAAGGGAATTGTGTGAATCTTGGAAAGCTAATTGAAAAACATTCCCATTAGTGGGT 
GATTTGCTGGAGTTTTGTGAATCTATTTTTCGAAATTTCiAGCGTGCGGGGGTGGTTTGTTTTTTTACAAT 
TGCCAGTTCATTCACGGTTGTTGAAATGTTCGGGGGTAATAACTCAACTTTCTATTTTCACCTTGTTGGG 
ATTTCGCTAGGGTGGACGATGGCAGCAATTGAATGTTGTAAATCACACAATTGCAAGGATTGTAATTTAA 
GGCACATCTATGTCGGTGTGAAATTACATGTGCCAGAAGAGCAATTTGCCAAGTAATCCAAGCGAGAAGG 
AGTGAGTTTT 



SEQ IDNO:5 
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CGATCATCGAACTCGGCGAAGAAAACCGAAACCTCAAAGAATCCCTGCGTAAGGTCACAGCTGAGAATGA 
GCAGCTCAAAGATCAATTACGCAGCGGGCGTCCGCGTGGCGAGCTGGTGCACGTGCCCCGCTCCACCGCG 
GTGGTCATGTGGGAACGCCGCAAGGGGCGTTCCAAGTAAAAACATGCTTGTCGACGCCGCTTTCTAGCAA 
ATTAAGCGGGCACCTCCATTTATCTTTTGGAGGTGCCCGTTTCGTGCTTTCGCCAATTAGATACATGCAT 
AACCACCCGAACAGGGGTAATAACTTTTGAAAGGCTTTCGGCGTTGAGCTGCGAGAATTTTGAGAAAAGG 
GGGTGAATTTAACAGGGGTTCTAGCGCGGATTGATTTTCGTGAATATGGTGGCTGCTAAGCGTGCGAATG 
TGCGCGTTATCACAATCGTTGACCAAGTGTCACCTGACGCACAGGTAGTGCTCAGGTGGAGGTGGCCCAA 
AGGAGACCCA 



SEQ ID NO:6 

TTTTAGACCACGGCGCTGTGTGGGGATTTAAGACGTCGGAAATTGTAGGGGACTGTCAGTGTGAGTCGGG 

TTCTTTGAGGCGCTTAGAGGCGATTCTGTGAGGTCACTTTTTGTGGGGTCGGGGTCTAAATTTGGCCAGT 

TTTCGAGGCGACCAGACAGGCGTGACAAGATTGACTAAAAAACCGAAGTTTTGGCACGTGTGTTTGGTTT 

CTCGGGGTCTAAACCGGACAGGCGTGACAAGATCTGGCGAAATCGCAGGTTTTTGTCACGCCTGTCTGGT 

TTTACC TTTTGGGGGCCCG AACTGCCCTGAACT ACTCGG ATCG ACC AAGC AGTT TGGC CTCC AGC GCTCT 

GATCAAGCACCCAACCGCCTCTAAATCACACCAAGGCACTCGTAAAACCCGTGGCAGATAGAGAAAGTGT 

GGCAGCAACTCGAATTGAAGAGCACAATTGAAGTCGCACCAAGTTAGGCAACACAATAGCCATAACGTTG 

AGGAGTTCAG 



SEQ ID NO:7 

AAAACAGCCAGGTTAGCGGCTGTAACCCACCACGGTTTCGGCAACAATGACGGCGAGAGAGCCCACCACA 
TTGCGATTTCCGCTCCGATAAAGCCAGCGCCCATATTTGCAGGGAGGATTCGCCTGCGGTTTGGCGACAT 
TCGGATCCCCGGAACCAGCTCTGCAATGACCTGCGCGCCGAGGGAAGCGAGGTGGGTGGCAGGTTTTAGT 
GCGGGTTTAAGCGTTGCCAGGCGAGTGGTGAGCAAAGACGCTAGTCTGGGGAGCGAAACCATATTGAGTC 
ATCTTGGCAGAGCATGCACAATTCTGCAGGGCATAGATTGGTTTTGCTCGATTTACAATGTGATTTTTTC 
AAC AAAAATAACACTT GGTC TGACC AC AT TTTCGG ACAT AAT CGGG CAT AAT T AAAGGT GTAAC AAAGGA 
ATCCGGGCACAAGCTCTTGCTGATTTTCTGAGCTGCTTTGTGGGTTGTCCGGTTAGGGAAATCAGGAAGT 
GGGATCGAAA 



SEQIDNO:8 

CCAGATCTCGGGATGCTTCGTAGATCAGCATGGCGCCGAGGTCGTTGGCTGCTGCACGGAAAGCTGCGTT 
GTCGCTGCGCTCGTCGCGCAACAGGGGTAGGCGGCTAGCAACGAGTGGGTGGTTGACGATGGTGATGTCC 
ATAGCTTCACATGTTAAATCATTGCCGCCCAGAAGAAGACCGCGCGGGCGAATTTGGGCTTGGAGGGAAC 
CAAACGGCCACTTTTCCAGTCCAACAAAGTATGAGGATTAATTTGCCCCACTCCAAAGAGCTCGCCCACG 
AGCTGTGTTTGTTGCCCACCCCTGCTGTGCCCGCGCTTCCCACTGATTCTGGCGCGCAGTTTGATATCCA 
CCAGGCACTGTCCGCCTCTCTTGCCACCTATGCCCGCAACCTCACCTTGCTGTCCCACACCGCCGAGAAT 
TTAGGAAACCGCGCTCTGACCGGCCTCGCTGAAATCGAAGACACCGACGACCAACTCGCACACGCATTGG 
AGCGCCTGAC 



SEQ ID NO:9 
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GGGCATCATGGGCATTGTTGGTGGATTCACCATGCCTGTCGCAATCGCTAGGACCAGGGATAAGAACCTC 
GTGTGGTTCCCCGTGGTCTTTGGTGCATCGATGTTCCTCGGTTATGTGGGAACGTGGCTGTGGCCGTCCC 
AAGGCTGGTACCTGTGGTCATTCCTTCTTGGTTTAGGTGGACTCTGCTTCCCGATGGCTATCGCCCTGAT 
TCCAGCGCGTACGAAAGATCCGAGAATTACCGCAAGCTTGTCTGGATTTGTGCAGCCGGTGGGTTACATT 
CTTGCAGCCCTTGGGCCATTGGCAGTGGGAGCGATCTACCAGGCGATTGGCTCCTGGTCAGAGATCCTCG 
TTGGTTTGGCCTTGGGCACAATAGTGTTGTCGATTGTGGGATTCAGAGCAGCACGCAATGTGACGGTTGA 
TGATGAATTGAGGAGATCAAAGTAGCCTCAACTAAGCGTCGCGATAAGAACGAGGGGCMGGCTGATGTA 
CTCTGTCAACCATGGATAAACCGGTCGTGAGGGATGCAGCTCTGCTGATTTTTCGCGCTGTGCTCGGAGT 
GATCTTTGTGGCACACGGGTGGGAAAAGCTGTTCATCTCCGGAGTTACCAAGACAACAGGACAATTTTCA 
GCCTGGGGAGTGCCTCAACCCAAGCTCTCGGTGTGGATCACATCGATCTCTGAGCTGCTCGGTGGTGCCT 
TCCTAGTGGTTGGTTTGCTCACCACCTTTGTTGCTGGTGCATTAGCGCTGTTGATCGCCGCTGCTATTTA 
CTTTGTGCACTTGAGTTCGGGCTTTTTCACAGTTGATAACGGCATCGAATTCCCCTTGCTCATCATTGTT 
TCTTTGCTCGTGATCGTTGTGTTTGGTTCTGGTAGAGCCAGCGTTGATGGGGTGCTCACGCGTGGTTGAC 
TGTAGTGCCATTCAAGCCGCGCTGTCCGCCAAATTAGATGGTGAGCCGACAGGCTTGGATGATGCAGTAA 
TTGAAGCGCACCTCGCTAATTGTGAAGAGTGCAGAAATTACTACAACCGTGCTGCTGAGTTGAATCGGAT 
GCTCAATTTTTGCGCCGCGGAACCTCGCACCCTGACCCCGCCTGATCTATCAGAGATCATTCTGGCAGAG 
GTGGAACCAGAATGGCGCAGGCATGCCAACGCCAAGGTTGTGGGATCCCTGCTATCGCGAGTGTTGTTGG 
TGATCCTGGGTGTGGTTTACCTCGCCTGGGGTATCACAATGTTGGGGGATTCGGCGTCGATAAGCGTCCA 
AGAAGACCCGCTCACCTCGCGCCTGCTCGCGGAGGCCGTTGCCTACCGCATTGCTCTTTCTGTGGGGCTG 
TTATTTGCGGCGTGGAAGCCGCGGATTATCGCGGGCATGCTCCCGATTTTTGGAACGTTGTGGACATTTA 
GTGCTGGTTTTGCTGCGCGCGATCTCGTGTTTGGCGTCGCCGATTCACAGACGGGACTGTCCATTGGTCT 
GCTGTTGATTTCTACGATTGTGCTGTCGTTTGCGTTGGTGAATAGTTCTGGACCGGGTATTTTGCGGCGC 
ACATGGAACTCATTGAACGCCGCGCCCGGCTAAGGTGGGAGGC 



SEQIDNOrlO 

TAATACGGCATCCGGCCTGAAAGGACAGTGTGGAACCCAAAAATCATGCCCTCGCAGAATCGTGTTTAAG 
GGGTTAAAACGCCTCGACCCACACTCTGACCCATCCGTGAAACTAGACCTCTTAAAACGACTTCTGGTTG 
GTGAGGTGTGAAAACCGCACTGCGGGGCCCAATTCACAAAAATCCATACGAGTGTGCACACCCGATTTTC 
ACATCGCTTCGAGACCTCCCTTTTTGACACCTTAATTGTCTAACCCCGTATAGGTGAGAAATGTTGGACA 
AGTGTCTGTTTTTGTGGGGGGAATCTGACTACGATGGTAAGAAACAGGGAAAGGGGTTACCATTATGTCT 
CAAGAGTAGCCTCAAATCGGCTCCCGCCTCTCTCGTGTCATTGAACAAGACGGCCTACAATTCCGCGATC 
TCGACGGCGACGGCGTACTTGCACCTTATG7\AGATTGGCGTCTAACCCCAGCAGAGCGTGCCGCTGACCT 
GGTGAAACGA 



SEQIDNOill 

CACCGCGTCGCCGATCTCTCCCTTCCAGACTCAGAAGTTGCTTCGGCAATTGCACTTGAGTGGAAACCAG 
GCGTCGGATTCACTACTTGAGCTGCTGATTTGTAGGTTTTAAGACCTTGAAGATATAGTTAATTCTCGTT 
GCAAGGAACCAGATTCCAAGCAATGCGTATTCCTCCATAGCTCAGTTGGCAGAGCATTCGACTGTTAATC 
GAAGGGTCACTGGTTCGAGCCCAGTTGGAGGAGCAAATTGAAACCCACTGTTTTTTAACAGTGGGTTTTT 
TTGCATGTTTCATACAGTTAACGAACCCCAATTTTGTAACTTCCACTCTCCTAGCTATGATGAATACTCG 
TTGCAAGGAAGTAATTCCTTTCCAATACTTATTCCTCCATAGCTCAGTTGGCAGAGCATTCGACTGTTAA 
TCGAAGGGTCACTGGTTCGAGCCCAGTTGGAGGAGCAATACACACAGCCCGCCGTTTTTCTTAAAACGGC 
GGGTTTTGTT 



SEQIDNO:12 



4 
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GGTGTCAGTGTATCCGCGAACCAGAGCCTCGATGGCAGCTGTGTCTTCTGGGGMGCGGTCTGGCCGTTT 
TCAACTGGCGCAGCTTCAGCGAAGGTAATTGGGTTCTCAGACAGCTCTTGTGACAGTACATCAAGCTGTG 
CTTGCTGCTCTTCGTTGATGGTTGTTGCTTCGGAAGATTCAGATTCAGATTCAGATTCGGACGAGGTGGT 
TTCTGCTGCTTCCGCCGAGGTCGTTGCGCTGGAAGAGGAAGAATTTGTCGTCGTCGCTGCGCTGGATGAG 
GCTGCTTCTGTATCAGATGACTCACTGCTACATGCGGTTAAGAGAAGTGGGGTGACCATGAGTGCTGCGA 
AAGCAGCCTTCTTTGAGGAAAGGCGAATAGACAAAGTTCTGCTCCTGATAAATCATCGACATGCTCCGGA 
AAACTTAAAAATTCCCGGACGGTTCACGCAGATTACCCTAGCAAAGCAATCTAGCTGACGACCCAATTTA 
GTCCTGTCATTATGCTGGCAATTGTGCAGCTATCTAAAGAATCTATTATTGGGGCAGCCGTTTCGATCCT 
GAGCGAGTTCGGTTTGTCGGATATGACCATGCGTCGCGTCGCAAAGCAATTAAATGTCGCGCCGGGCGCG 
CT GT ATT GG CAT TTT AAAAAT AAG C AG GAGCTT AT CG ACG CC ACCT CACGCCAT CT CCT GGCGCCT AT CT 
TGGGGCGCAACGACGAGCAGCGAGCAAGCATTTCCGCGCAGGAAACGTGCGCAGAAATGCGTTCACTGAT 
GATGCAAACCAAAGACGGTGCGGAAGTCATCAGTGCCGCACTGAGTAATCAGCAACTGCGCCAAGAATTG 
GAATCTCTCATTTCCGACTCTTTAAAAGAACCTAATGAGGTCGGTGCTTTTACGCTGCTGCATTTTGTGG 
TGGGTGCAGTATTAACAGAACAAACTCAGCTGCAGATGCACGAGTTCACGGCTGGCGCGGAAGATGACAC 
ACAAGAAAACCCTGCCGATGCGAACTTTGAGGAGAGATTCAATCAAGGATTAGAAATCATTCTGGCGGGT 
CTAGACGCGCTTGGGCATATAAGATAGCGTTCT 



SEQIDNO:13 

AGTGACAGCTCGCGCCGCATCGTTGATGGCAAACATGTCCAAATTAAGGCGCATGCGACCAAGGATGGTG 
AGCATTTGCCCTGATTCGTGGCTGAAGTACATCGAAATGCGGTGATCTTGCCACGGCACGATGATGCGAT 
CTTCTGAGCTCAAATAGTGGTAGCCCAAGGAATCAACAGCCTCCGTCACTCGATTCAGGTCAACTGGGAA 
AGGAATGGAGGTGTTGGGTGCAGGGACGTTTCGATCATTCACAGAGGTGAAATCCATTCCTTCCAGTATC 
TCAAAGGTGAAAGCGGGTTTAATTCAGGTAT^ATCTGGGGTGGTCATTTTAAGTTTTAAGTCTAATTCAAA 
TGAACTCTGATGTACCCAAATCAGAAACTTGTTACGTGGGGAATACAATAGGTAAATATGCGGGCTTAAG 
AACTTGTGTTGAGGCCGCTTGGATTCGGGCACCGAGCTCGAAGAATTTCGATTCAACCTTTTAAGGGAGA 
ACTTTTCGCC 



SEQBDNO:14 

CGTGGCGTGCCACCCATTTAAGTCCCGCGGGGAGACTGAAGATGGTGGTGCCGTCGGCGAGGCGTTCTGC 
CTGCGTAATGGGGTTAAGTGGGATGAACGGGGGAGTCAGACGTGCGACAGCGCCCTTGCGGGTATGCCAA 
TCCCAGACCATTTCTCGGGGAAAAGGAATAAAATGGCTTGTGGTCAGACTCACAGGGGCTTCTCCAAGTC 
AGT GGATTT ATG AGGT CCC AGTGGGT ACAC ACCGGGTGTCCT ACAACGATCAATTGTCAC AGATTCG ACT 
GGCATGCTGTACCATCTGCTTTAAGCATTTTGGTGTTTCACTGTTGTTAACAGTGTTTCACCGTGGAGCA 
CTACCTTAGATCATAGTCAGCATCTTGGGGTGAATGTGACACGGTACGCTATAGTGTCAGACAACAACCA 
GGAAACTGGTCGTTGCAGAGTTTTTGCAAAATTGGACATCCTTTAACGGACCGCACAGAGAGGCGGGGAA 
GGAGGTCACG 



SEQ IDNO:15 

GCCATGGCGTTGCGGAAATCGTAATCGGCCATTTTGTCGGTGGGGAAGAGGTGGATGTGAGTGTGGGGAA 
CATCGAATCCTGCGATGATGTAACCACATCGAGGGGCGTCGAATGCTGTGCGGATTGCATTTCCGATGAG 
CTGGGAGGCCTCGTTTACTTCGCTCCAGATGTTCTGAGGAAGGTCGGTCCAGCGGTCAACTTCTGCAACG 
GGTACGACTAGGGTGTGGCCGTAGGTGAGGGGTTCGATGGATAGAAAAGCCACGACATTCTCGGAACGAT 
ACAC AAAT CGGCCGGGG AGCT CGCCATT AAT AATT TT CGTG AAT AC AG AAGCC AT ATGC AC AG ACT ACT A 
CTTGGCGTGCAACCAAATTGAGGTTGCATAAAATAATGCAGGTGAGCCGGTCATTTTAAGGCGCTTTTCG 
ACGCCACTTTCAACCATTTCCGAACCGCCAAGAATACTGGAATAGCTTGGATCAAGTTTTGCAGGATAAA 
CTGTGCAACC 



SEQ ID NO: 16 
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ACGCTGATCGTTTTGAGCTTATCGACGCTCGCCTGCGCTCAGCTGGTTTCGATTGGTACGAGGTATCCAA 
CTGGGCGAAACCCGGCGGAGAATGCAAGCACAACATGGGCTATTGGGTCGACGGCGACTGGTGGGGTGCT 
GGCCCGGGCGCGCACTCGCACATCGGCGACCGCCGCTTCTACAACATCAAGCACCCAGCGCGTTACTCCG 
CGCAGATTGCGGCCGGCGAGCTGCCCATTAAGGAAACAGAGCGGCTGACGGCGGAAGATCACCACACCGA 
GCGCGTCATGCTTGGTTTGCGCCTGAAACAAGGCGTGCCGCTGAACCTTTTCGCACCCGCAGCGCGCCCG 
GTCATCGACCGTCATATCGCAGGAGGCCTGCTGCACGTCAATGCGCTGGGCAACCTGGCGGTGACCGATG 
CGGGACGTTTGCTTGCCGACGGCATCATCGCCGACATTTTGCTTAGTGAAGAAGACTAAATATTTAGTAG 
GGTTACAGAC 



SEQIDNO:17 

TGGCGACTATGCCACATAGTCGACTACCTTGCATAGTTGACTATTTGATTGAGTTGAACTGTGTCAGTGT 
ATGAAGAGAAAAATGAAGAGAAAAACAGCTGCACATGGTTT CAAAAGATAAGGAAGTGAAGCATGAG CAT 
CGAGCCAGGAATCCCCACGCTTGGACCGCTTGAAGAACAAGTCATGCACATTCTGTGGGATCACGGAAAA 
TTGACAGTCCGTGAAGTCATCGAATTCCTTCCAGGTGATCCTGCGTACACAACGATCGCAACCGTCCTGC 
GTCACTTGGGCAGAAAAGGCATGGTCACCATTGTGAAAGATGGTCGGACTGCTCGACACAGCGCGTTGAT 
GAACAGGGAAGAATACACCGCTGGCGTCATGGATCAGGTGCTGTCGACCAGTCGGGATCGCACCGCATCA 
ATTCTGCATTTCGTGGATACGATCACGGCGACTGATCGCGAGCTGCTTCTGGAGTATCTGCAACAGCAGG 
AGGGCAGGAA 



SEQIDNO:18 

AGACTTGGCGTCAAAGTTGAGTGGGACCTGTTGAAAACAGCTCGTATCTTCCCTGATTGGGTGGTTGAAA 
TTAGGGGTAAACCCGGATTTTTTCTCAAGTGAAGCGCTTTGACCTGTGTAAATTAAGAAAGTTGAGTCTA 
GTGGGAACAACTTTGTGGCATTTACCGTTGCCATATATGTAAGCTTGAGTCAGGCAGGCTCAATGAGGAG 
TTTTTCTTACCGGCGAAAGTCGGTGGAAGCAAGTCAAAGCTCAAGCCGTGGACAATACTAAAATCACCTA 
- AAACAGGAGGCACCATT 



SEQIDNO:19 

CGGGAGCCGTTCTTCTCCACGATGGGGCGCTCGGTGAAATCACTGAAACGCGCGGTTTCCAGAAGCATCA 
CCACGATGGCGTCACTGACCTCAAAAGATGCAGTGTGCTCATCACTGAAGACGGTGTTTTCTTTGAAGCC 
AAG ACCTG CAT AAAAAACG CTTAG ATG CTG CAAGGTC AGAT ACTG GT AGGT TG ATGAAAAT CAT GT CGTG 
CTGAAGTCGTGCCATGGGAGTTCCACTTCCTCAAAGTGCCTTTTGGCTAATGTGACCCCAACCAGTATGC 
TGCCGGTTTTTGGATTAGGTTTGGCCATCGTGCTAGCATATTTAGGTCTCGGCGAGGGTCAAGTACTTTT 
AGTGCTCAACCGTTATCGACGCGATCTAGACTTCTAAAGTGCACTTTTGTGCGCTGCCTGCGAAGACTCG 
ACCAAGACATTCGAGTCGGTCGCGGGCATTTTTTATTTTCGCGGCCGAGTGTCCACCTTCATCCATGAGG 
AGAAATCACT 



SEQ ID NO:20 

TGGATTGTCTGGCGGACAAAGTTCGCACCGATAAACCCGGCACCACCGGTCACAAGCAAAGAAGTCATGG 
TCGTCATAGTAGTTGCGGCTCCTGCAGTTGCGCGTGTTATCGCTGACGGGCATGTCAATTCAGCCGTGAC 
CCAGCTTCGGTGCGGTGTGAAAACGTATGGGGAAAAGCCACGGATAGTGCATGCCCACTTCTCCACATGG 
GCTGTGCTCAGCTGTCGTAGGGGCCACGGCTTGCGGACTTTCTGAACACCCGGACGAAAGCCTCCATTAT 
TGATGGCACCCACCCGGGGGTGGTATCGGTTGCTAAGTTCACAGTGGTCGGAGAGAATAAGTCAGCACTT 
CTGATTACATGTACGAAATCAGGGCGGTCAAATGCCAGTTCCCACCCGGCTTCACTAAGGCGGGACGACC 
ATAGAAACGCAGCTTCTGACATTGAATGCGCCGGTTTGCTATGGACTACACCGCATATGAGGAAAGGGCT 
TGAAACGCAC 



SEQIDNO:21 
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CTTAATGAGTACGGCGTGGCTTATATCGGTTTCGCTCCGTTTGAAATCGGCATGACCATCATCGCGCCGA 
TCGCGGTGCTCGCAGGCTTTACTATGGGGTTGGGTTGGGCGTCGTTGATTGTTGCGATCGTGATTTTTGG 
CCTCGCGTGGGGTCTGAAGTGGTTGCCGGAGCGCGGACATGTCCGCGGCGAGGGTAAGCCGCAATAAAGG 
TTGGAAGCGCCGGGTCTAGGTCCGGCGCTTCTTTCGTACGCTTTTCGACGCCTCCCTCCACGTAATATTA 
AAGTTACGGGTTTTCCCTGATGCTTAAGTGGTAGTCAGTGCTTAAACTTGACTGCGGTCCACTCAATTTA 
TTTTCAAATTTTTTGAACTTGAGTGGAACATACTCAACTCTTTGTGCGTTATAGATATTAGAGAGTTAAA 
TAATGGCGCTTGACCTGCAGGAAATTGAGATCAACACTGATTGTGTAGGTTGGCGCCCAACAAAGAAAGG 
GCGTTGAAAG 



SEQ ID NO:22 

TGGCCATTCCTGTGGTCCATGCTCGCCCTGTTCTTCTTCACTGGACTGGGCAACGCCGGCACATTCAAAC 
AAATGCCCATGATTTTGCCCAAACGCCAAGCAGGTGGCGTGATCGGCTGGACCGGTGCCATTGGTGCCTT 
CGGCCCCTTCATTGTCGGTGTCTTGCTCTCCTTCACTCCAACTGTCGCGTTCTTCTGGGGCTGCGTGGTG 
TTCTTCATCATCGCCACCGCTTTGACCTGGATCTACTACGCCCGCCCGAACGCTCCATTCCCGGGATAAA 
CCGAAAGGCCAATCCATGACTACAACTACTTCTTCTGGGAAGTCTTCTGAACAGTCTTCTGAAAAGATCA 
ACCCCCTCTTCAAGCTCGGCAGTTTCCTAAGAAAAGGCACCGTCGGTTCTGAAGGCCAGCAGATTTTCCT 
TCAGGGCGGACGCCAAGCCGATGTGTTTTTATCGCAACCCGATGGGCGTTCGATAAAAGTCGTGCGCTCC 
CACACATGGC 



In certain embodiments, polynucleotides of the invention comprise a first 
nucleic acid, the sequence of which is at least 90%, 91%, 92%, 93%, 94%, 95%, 
96%, 97%, 98% or 99% identical to a sequence selected from the group 
consisting of SEQ ID NO:4, SEQ ED NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ 
ID NO:8, SEQ IDNO:9, SEQ ID NO: 10, SEQ ID NO: 1 1, SEQ ID NO: 12, SEQ 
ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, 
SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, and SEQ ID 
NO:22, or a complementary sequence thereof. Of course, this embodiment also 
encompasses a first nucleic acid, the sequence of which is identical to any of the 
recited nucleic acid sequences. 

By a polynucleotide comprising a first nucleic acid, the sequence of which 
is at least, for example, 95% "identical" to a reference nucleotide sequence is 
intended that the first nucleic acid sequence is identical to the reference sequence 
except that the first nucleic acid sequence may include up to five mismatches per 
each 100 nucleotides of the reference nucleic acid sequence. In other words, to 
obtain a first nucleic acid, the sequence of which is at least 95% identical to a 
reference nucleic acid sequence, up to 5% of the nucleotides in the reference 
sequence may be deleted or substituted with another nucleotide, or a number of 
nucleotides up to 5% of the total nucleotides in the reference sequence may be 
inserted into the reference sequence. The reference (query) sequence may be any 
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one of the entire nucleotide sequences shown, in SEQ ID NO:4, SEQ ID NO:5, 
SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, 
SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID 
NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ 
5 ID NO:20, SEQ ID NO:21, or SEQ ID NO:22, or any fragment of any of these 

sequences, as described infra. 

As a practical matter, whether any particular nucleic acid sequence is at 
least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identical to, for 
instance, a nucleotide sequence selected from the group consisting of SEQ ID 

10 NO:4, SEQ ID NO:5, SEQ ED NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID 

NO:9, SEQ ID NO: 10, SEQIDNO:ll, SEQ ID NO: 12, SEQ ID NO: 13, SEQ 
ID NO: 14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, 
SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, and SEQ ID NO:22, or a 
complementary sequence thereof, can be determined conventionally using 

15 sequence analysis computer programs such as a OMIGA® Version 2.0 for 

Windows, available from Oxford Molecular, Ltd. (Oxford, U.K.). OMIGA uses 
the CLUSTAL W alignment algorithm using the slow full dynamic programming 
alignment method with default parameters of an open gap penalty of 10 and an 
extend gap penalty of 5.0, to find the best alignment between two nucleotide 

20 sequences. When using CLUSTAL W or any other sequence alignment program 

to determine whether a particular sequence is, for instance, 95% identical to a 
reference sequence according to the present invention, the parameters are set, of 
course, such that the percentage of identity is calculated over the full length of the 
reference nucleotide sequence such that gaps, mismatches, or insertions of up to 

25 5% of the total number of nucleotides in the reference sequence are allowed. 

This embodiment of the present invention is directed to polynucleotides 
comprising a first nucleic acid, the sequence of which is at least 90%, 91%, 92%, 
93%, 94%, 95%, 96%, 97%, 98% or 99% identical to a nucleic acid sequence 
selected from the group consisting of SEQ ID NO:4, SEQ ID NO:5, SEQ ID 

30 NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID 
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NO:l 1, SEQ ID NO: 12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ 
ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, 
SEQ ID NO:21, and SEQ ID NO:22, or a complementary sequence thereof, 
irrespective of whether they have functional activity. This is because even where 

5 a particular polynucleotide does not have functional activity, one of skill in the 

art would still know how to use the nucleic acid molecule, for instance, as a 
hybridization probe, an SI nuclease mapping probe, or a polymerase chain 
reaction (PCR) primer. 

Preferred, however, are polynucleotides comprising a nucleic acid, the 

10 sequence of which is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% 

or 99% identical to a nucleic acid sequence selected from the group consisting of 
SEQ ID NO:4, SEQ ID NO:5, SEQ ED NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ 
ID NO:9, SEQ ID NO: 1 0, SEQ ID NO: 1 1 , SEQ ID NO: 1 2, SEQ ID NO: 1 3, SEQ 
ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, 

15 SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, and SEQ ID NO:22, or a 

complementary sequence thereof, which do, in fact, have functional activity, in 
particular, polynucleotides which are capable of regulating transcription in 
Corynebacteriutn species. Examples of polynucleotides which regulate 
transcription include promoters, operators, transcription factor recognitions sites, 

20 transcriptional complexes, and transcriptional regulatory elements, as described 

herein. Preferably, polynucleotides of the present invention comprise a promoter 
which functions in Corynebacterium species, preferably in Corynebacteriutn 
glutamicum, Brevibacterium flavum, or Brevibacterium lactofermentwn, even 
more preferably in C. glutamicum. Assays to determine whether a polynucleotide 

25 is capable of regulating transcription in Corynebacterium species can be routinely 

performed using techniques described herein and otherwise known in the art. 

Accordingly, the present invention also encompasses the above 
polynucleotide, further comprising a second nucleic acid. Preferably, the second 
nucleic acid encodes one or more polypeptides, and the physical location of the 

30 first nucleic acid relative to the second nucleic acid is such that, under the 
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appropriate conditions, the first nucleic acid will operably regulate transcription 
of the second nucleic acid, thereby facilitating production of the one or more 
polypeptides. "Facilitation of production" includes increasing production 
constitutively, decreasing or blocking production except under specific 
5 conditions, and/or increasing production except under specific conditions. 

Among the "conditions" contemplated by the present invention are: (1) adding a 
component to a culture medium, (2) removing a component from a culture 
medium, (3) replacing one component of a culture medium with a second 
component, (4) increasing the temperature of the culture medium, (5) decreasing 

10 the temperature of the culture medium, and (6) regulating the atmospheric 

conditions {e.g. , oxygen or nitrogen concentrations) in which the culture medium 
is maintained. Examples of such conditions are described in more detail, infra. 

One particularly preferred aspect of this embodiment is a polynucleotide 
comprising a nucleic acid, the sequence of which is at least 90%, 91%, 92%, 

15 93%, 94%, 95%, 96%, 97%, 98% or 99% identical to a nucleic acid sequence 

selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID 
NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 1 1, SEQ ID NO: 12, SEQ ID 
NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ 

20 ID NO: 18, SEQ ID NO: 19, SEQIDNO:20, SEQIDNO:21, and SEQ EDNO:22; 

and a second nucleic acid, which encodes a component of an amino acid 
biosynthesis pathway. Examples of amino acid biosynthesis pathways included 
in this aspect of the invention are the enzymatic pathways that result in the 
synthesis of L-glycine, L-alanine, L-leucine, L-methionine, L-phenylalanine, L- 

25 ' tryptophan, L-lysine, L-glutamine, L-glutamic acid, L-serine, L-proline, L-valine, 

L-isoleucine, L-cysteine, L-tyrosine, L-histidine, L-arginine, L-asparagine, L- 
aspartic acid, and L-threonine. 

A preferred amino acid biosynthesis pathway results in the synthesis of 
L-lysine. In this preferred aspect of this embodiment, the second nucleic acid 

30 encodes one of the following components of the L-lysine biosynthesis pathway: 
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aspartokinase, aspartate beta-semialdehyde dehydrogenase, diaminopimelate 
dehydrogenase, diaminopimelate decarboxylase, dihydrodipicolinate synthetase, 
dihydrodipicolinate reductase, or pyruvate carboxylase. 

The present invention is further directed to a polynucleotide comprising 
5 a first nucleic acid, the sequence of which comprises about 6 contiguous 

nucleotides, preferably about 10 contiguous nucleotides, even more preferably 
about 12, 15, 20, 30, 40, 50, 100, 150, 200, or in some cases up to about 500, 
1000, or 1500 contiguous nucleotides, of a sequence selected from the group 
consisting of SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ 

10 ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO:12, SEQ 

ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ED NO:17, 
SEQ ID NO: 18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, and SEQ ID 
NO:22, or the complement of any of these sequences. By a first nucleic acid the 
sequence of which comprises about 1 0 contiguous nucleotides of any of said 

15 nucleic acid sequences, for example, is intended a nucleic acid which includes 

about 1 0 contiguous bases from any of said nucleotide sequences. Of course, the 
polynucleotide includes at least 10 contiguous nucleotides of any of said nucleic 
acid sequences, and may include about 12, 15, 20, 30, 40, 50, 100, 1 50, 200, or 
in some cases up to about 500, 1 000, or 1 500 contiguous nucleotides, or the entire 

20 sequence, of any of said nucleic acid sequences. In this context "about" includes 

the particularly recited size, larger or smaller by several (5, 4, 3, 2, or 1) 
nucleotides, at either terminus or at both termini. 

Representative examples of polynucleotides of the invention include a 
first nucleic acid the sequence of which comprises, for example, a sequence from 

25 about nucleotide 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 

351-400, 401-450, or 450-500 of SEQ IDNO:4, SEQ ID NO:5, SEQ IDNO:6, 
SEQ IDNO:7, SEQ ID NO:8, SEQ ED NO:10, SEQ IDNO:l 1, SEQ ID NO:13, 
SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID 
NO: 19, SEQ ID NO:20, SEQ ID NO:21, or SEQ ID NO:22, or the 

3 0 complementary DNA strand thereto; a sequence from about nucleotide 1-50,51- 
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100,101-150,151-200,201-250,251-300,301-350,351-400,401-450,450-500, 
501-550,551-600,601-650,651-700,701-750,751-800,801-850,851-900,901- 
950,951-1000, 1001-1050, 1051-1100, 1101-1150, 1151-1200, 1201-1250, 1251- 
1300, 1301-1350, 1351-1400, 1401-1450, 1451-1500 or 1501-1583 of SEQ ID 
NO:9, or the complementary DNA strand thereto; a sequence from about 
nucleotide 1-50,51-100, 101-150, 151-200,201-250,251-300,301-350,351-400, 
40 1 -450, 450-500, 50 1 -550, 55 1 -600, 60 1 -650, 65 1 -700, 70 1 -750, 75 1 -800, 801 - 
850, 851-900, 901-950, 951-1000, or 1001-1083 of SEQ ID NO:12, or the 
complementary DNA strand thereto; or a sequence from about nucleotide 1-50, 
51-100, 101-150, 151-200, 201-250, or 251-297 of SEQ ID NO:18, or the 
complementary DNA strand thereto. In this context "about" includes the 
particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1) 
nucleotides, at either terminus or at both termini. Additional examples of 
polynucleotides of the invention include a first nucleic acid the sequence of which 
comprises a consensus -10 or -35 region as listed in Table 2, infra. 

This embodiment of the present invention is directed to a polynucleotide 
comprising a first nucleic acid, the sequence of which comprises about 6 
contiguous nucleotides, preferably aboutlO contiguous nucleotides, even more 
preferably about 12, 15, 20, 30, 40, 50, 100, 150, 200, or in some cases up to 
about 500, 1 000, or 1 500 contiguous nucleotides, of a sequence selected from the 
group consisting of SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, 
SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO.10, SEQ ID NO: 1 1 , SEQ ID NO:12, 
SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID 
. NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, and 
SEQ ID NO:22, or the complement of any of these sequences, irrespective of 
whether they have functional activity. This is because even where a particular 
polynucleotide does not have functional activity, one of skill in the art would still 
know how to use the nucleic acid molecule, for instance, as a hybridization probe, 
an S 1 nuclease mapping probe, or a polymerase chain reaction (PCR) primer. 
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Preferred, however, a polynucleotide comprising a first nucleic acid, the 
sequence of which comprises about 6 contiguous nucleotides, preferably about 1 0 
contiguous nucleotides, even more preferably about 12, 15, 20, 30, 40, 50, 100, 
1 50, 200, or in some cases up to about 500, 1 000, or 1 500 contiguous nucleotides, 
of a sequence selected from the group consisting of SEQ ID NO:4, SEQ ID NO:5, 
SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, 
SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID 
NO: 1 5, SEQ ID NO: 1 6, SEQ ID NO: 1 7, SEQ ID NO: 1 8, SEQ ID NO: 1 9, SEQ 
ID NO:20, SEQ ID NO:21, and SEQ ID NO:22, or the complement of any of 
these sequences, which do, in fact, have functional activity, in particular, 
polynucleotides which regulate transcription in Corynebacterium species. 
Examples of polynucleotides which regulate transcription in Corynebacterium 
species include promoters, operators, transcription factor recognitions sites, 
transcriptional complexes, and transcriptional regulatory elements, as described 
herein. 

Preferably, polynucleotides of the present invention comprise a promoter 
which functions in Corynebacterium species, preferably in Corynebacterium 
glutamicum, Brevibacterium flavum, or Brevibacterium lactofermentum, even 
more preferably in C glutamicum. Assays to determine whether a 
polynucleotide is capable of regulating transcription in Corynebacterium species 
can be routinely performed using techniques described herein and otherwise 
known in the art. 

Accordingly, the present invention also encompasses the above 
polynucleotide, further comprising a second nucleic acid. Preferably, the second 
nucleic acid encodes one or more polypeptides, and the physical location of the 
first nucleic acid relative to the second nucleic acid is such that, under the 
appropriate conditions, the first nucleic acid will operably regulate transcription 
of the second nucleic acid, thereby facilitating production of the one or more 
polypeptides, as described herein. 
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One particularly preferred aspect of this embodiment is a polynucleotide 
comprising a first nucleic acid, the sequence of which comprises about 6 
contiguous nucleotides, preferably about 10 contiguous nucleotides, even more 
preferably about 12, 15, 20, 30, 40, 50, 100, 150, 200, or in some cases up to 
5 about 500, 1000, or 1500 contiguous nucleotides, of a nucleic acid sequence 

selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID 
NO:8, SEQ IDNO:9, SEQ IDNO:10, SEQ ID NO: 1 1, SEQ IDNO:12, SEQ ID 
NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ 

10 IDNO:18,SEQIDNO:19,SEQIDNO:20,SEQIDNO:21,andSEQIDNO:22; 

and a second nucleic acid, which encodes a component of an amino acid 
biosynthesis pathway. A preferred amino acid biosynthesis pathway results in the 
synthesis of L-lysine. Examples of amino acid biosynthesis pathways, and 
specific components of the L-lysine biosynthesis pathway are described herein. 

1 5 The nucleotide sequences of functional bacterial promoters exhibit a 

limited but significant degree of variability depending on the gene that is 
regulated by the particular promoters in question, and the bacterial species in 
which the promoter is found. See, e.g., P&tek et a/., Microbiology 142:1297 
(1996), which is incorporated herein by reference in its entirety. Despite this 

20 variability, investigators have identified promoter consensus sequences by 

comparing the nucleotide sequences of multiple bacterial promoters to one 
another. A consensus sequence is a sequence that reflects the most common 
nucleotides that are found in particular positions for a multitude of promoters. 
Frequently, the functionality of a promoter consensus sequence is confirmed by 

25 mutational analysis in an experimental system; i.e., the expression of a reporter 

gene that is under the control of a mutated promoter is assessed. 

Early experiments and analyses of E. coli promoters identified a 
consensus sequence that comprises two nucleotide hexamers (six contiguous 
nucleotides) separated by about 17 nucleotides, and located about 35 and 10 

30 nucleotides, respectively, upstream of the transcriptional start site for the 
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corresponding gene. Patek a/., Microbiology 142:1297 (1996). These two 
hexamers are known individually as the -35 and -10 regions, and their respective 

■ 

consensus sequences are TTGaca, and TAtaaT. Within a consensus sequence, 
capital letters denote particular nucleotides that are found in analogous positions 
5 in at least 70% of the promoter sequences analyzed. Lower case letters denote 

nucleotides that are found in analogous positions in between 42% to 70% of the 
promoter sequences analyzed. 

Recently, Patek et al analyzed the sequences of 1 8 promoters from C. 
glutamicum, and four promoters from the temperate corynephage pGAl, to 

10 identify a promoter consensus sequence specific to C glutamicum. From their 

analysis, Patek et al identified a -35 consensus sequence of ttGcca, and a -10 
consensus sequence of TA.aaT. The period (".") at the third position of the -10 
consensus sequence indicates that the nucleotide at this position does not 
establish a consensus among the C. glutamicum promoters that were compared 

15 by Patek et al 

Accordingly, in one aspect of this embodiment of the present invention, 
a polynucleotide is provided which comprises a -10 promoter consensus 
sequence, and/or a -35 consensus sequence derived from a sequence selected 
from the group consisting of SEQ ID NO:4, SEQ ID NO:5, SEQ ED NO:6, SEQ 

20 ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:l 1, SEQ 

ID"NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, 
SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID 
NO:21, and SEQ ID NO:22. While not being bound by theory, such sequence 
motifs may function as part of a promoter capable of regulating transcription in 

25 Corynebacterium species, preferably in Corynebacterium glutamicum, 

Brevibacterium flavum, or Brevibacterium lactofermentum, evenmore preferably 
in C. glutamicum. 

To identify -10 promoter and/or -35 promoter consensus sequences, 
computer analyses of SEQ ID NOs: 1 through 22 were conducted using the 
30 prokaryotic nucleic acid motif search functionality provided by the OMIGA 2.0 
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computer software package. In addition, SEQ ID NOs: 1 through 22 were 
analyzed visually to identify any sequences that agree with the consensus 
sequences identified by Patek et ah A summary of predicted -10 and -35 
consensus sequences identified for SEQ ID NO: 1 through 22 is listed in Table 2. 
5 It is to be noted that in the numbering convention of SEQ ID NOs 1 through 22, 

all nucleotides have a positive number and the numbers increase. This is in 
contrast to the normal convention for promoter regions, wherein the nucleotides 
are assigned negative numbers which increase up to nucleotide number 1 at the 
putative transcription start site. 
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TABLE 2 



SEQ ID NO: 


-10 sequence(s) 


Position 


-35 sequence(s) 


Position 


1 


• 




TTGAGA* 


375 


1 






TTGAAA* 


386 


3 






TTGAAA* 


339 


4 


TACAAT* 


275 






4 






TTGAGA* 


57 


4 






TTGAAA* 


186 


4 






TTGAAA* 


301 


4 






TTGCCA* 


466 


5 






TTGAAA* 


307 


5 






TTGAGA* 


340 


7 


TACAAT* 


334 






7 






TTGCCA* 


224 


8 






TTGATA* 


341 


8 






TTGCCA* 


371 


9 






TTGATA* 


803 


10 


TACAAT* 


406 






10 






TTGACA* 


234 


11 






TTGAAA* 


248 


13 


TACAAT* 


394 






13 






TTGCCA* 


118 


14 


TAAAAT* 


169 






15 


TA-AA^A.T^ 


379 






17 






TTGACA* 


211 


18 






TTGAAA* 


31 


18 






TTGAAA* 


65 


18 


TAAAAT* 


269 






18 






TTGCCA* 


168 


20 






TTGAAA* 


490 


21 






TTGAAA* 


41 


21 






TTGAGA* 


445 


21 






TTGAAA* 


494 



* consensus sequences identified by the OMIGA 2.0 program 

J consensus sequences identified visually by comparison to consensus sequences set 

forth in Patek et ah 



WO 02/40679 



PCT/US01/43096 



-40- 



In another embodiment, the invention provides an isolated polynucleotide 
comprising a first nucleic acid at least 1 0 nucleotides in length, preferably at least 
12, 15, 30, 50 or 150 nucleotides in length, which hybridizes under stringent 
conditions to a reference nucleic acid, or the complement thereof, wherein the 
sequence of said reference nucleic acid is selected from the group consisting of 
SEQ IDNO:4, SEQ ID NO: 5, SEQ IDNO:6, SEQ ID NO:7, SEQ ID NO: 8, SEQ 
IDNO:9,SEQIDNO:10, SEQIDNO:ll,SEQIDNO:12,SEQIDNO:13,SEQ 
ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, 
SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, and SEQ ID NO:22. By 
"stringent hybridization conditions" is intended overnight incubation at 42 °C in 
a solution comprising: 50% formamide, 5x SSC (750 mMNaCl, 75mM trisodium 
citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 1 0% dextran 
sulfate, and 20 jig/ml denatured, sheared salmon sperm DNA, followed by 
washing the filters in O.lx SSC at about 65°C. 

By a first nucleic acid "at least 1 0 nucleotides in length," which hybridizes 
to a reference nucleic acid is intended a first nucleic acid (either DNA or RNA) 
which hybridizes to 10 or more contiguous nucleotides from the nucleotide 
sequence of the reference nucleic acid (e.g., SEQ ID NOs 4-22). 

This embodiment of the present invention is directed to an isolated 
polynucleotide comprising a first nucleic acid at least 1 0 nucleotides in length, 
preferably at least 12, 15, 30, 50 or 150 nucleotides in length, which hybridizes 
under stringent conditions to a reference nucleic acid, or the complement thereof, 
wherein the sequence of said reference nucleic acid is selected from the group 
consisting of SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ 
ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 1 1 , SEQ ID NO: 12, SEQ 
ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, 

* 

SEQ ID NO:18, SEQ ID NO:19 5 SEQ ID NO:20, SEQ ID NO:21, and SEQ ID 
NO:22, or the complement of any of these sequences, irrespective of whether they 
have functional activity. This is because even where a particular polynucleotide 
does not have functional activity, one of skill in the art would still know how to 
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use the nucleic acid molecule, for instance, as a hybridization probe, an SI 
nuclease mapping probe, or a polymerase chain reaction (PCR) primer. 

Preferred, however, an isolated polynucleotide comprising a first nucleic 
acid at least 10 nucleotides in length, preferably at least 12, 15, 30, 50 or 150 * 
nucleotides in length, which hybridizes under stringent conditions to a reference 
nucleic acid, or the complement thereof, wherein the sequence of said reference 
nucleic acid is selected from the group consisting of SEQ ID NO: 4, SEQ ID 
NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID 
NO: 10, SEQIDNO:11,SEQIDNO:12,SEQIDNO:13,SEQIDNO:14,SEQ 
ID NO:15, SEQ ID NO: 16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO: 19, 
SEQ ID NO:20, SEQ ID NO:21, and SEQ ID NO:22, or the complement of any 
of these sequences, which do, in fact, have functional activity, in particular, 
polynucleotides which regulate transcription in Corynebacterium species. 
Examples of polynucleotides which regulate transcription in Corynebacterium 
species include promoters, operators, transcription factor recognitions sites, 
transcriptional complexes, and transcriptional regulatory elements, as described 
herein. Preferably, polynucleotides of the present invention comprise a promoter 
which functions in Corynebacterium species, preferably in Corynebacterium 
glutamicum, Brevibacterium flavum, or Brevibacterium lactofermentum, even 
more preferably in C. glutamicum. Assays to determine whether a polynucleotide 
is capable of regulating transcription in Corynebacterium species can be routinely 
performed using techniques described herein and otherwise known in the art. 

Accordingly, the present invention also encompasses the above 
polynucleotide, further comprising a second nucleic acid. Preferably, the second 
nucleic acid encodes one or more polypeptides, and the physical location of the 
first nucleic acid relative to the second nucleic acid is such that, under the 
appropriate conditions, the first nucleic acid will operably regulate transcription 
of the second nucleic acid, thereby facilitating production of the one or more 
polypeptides, as described herein. 
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One particularly preferred aspect of this embodiment is an isolated 
polynucleotide comprising a first nucleic acid at least 10 nucleotides in length, 
preferably at least 12, 15, 30, 50 or 150 nucleotides in length, which hybridizes 
under stringent conditions to a reference nucleic acid, or the complement thereof, 
wherein the sequence of said reference nucleic acid is selected from the group 
consisting of selected from the group consisting of SEQ ID NO: 1 , SEQ ID NO :2, 
SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ED NO:6, SEQ ID NO:7, SEQ 
IDNO:8, SEQ IDNO:9, SEQ IDNO:10, SEQ IDNO:ll, SEQ IDNO:12, SEQ 
ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, 
SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, and SEQ ID 
NO:22; and a second nucleic acid, which encodes a component of an amino acid 
biosynthesis pathway. A preferred amino acid biosynthesis pathway results in the 
synthesis of L-lysine. Examples of amino acid biosynthesis pathways, and 
specific components of the L-lysine biosynthesis pathway are disclosed herein. 

Another embodiment of the invention provides an isolated polynucleotide 
comprising a first nucleic acid, the sequence of which is at least 90%, 91%, 92%, 
93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 2, and a 
second nucleic acid which encodes a polypeptide which functions as a component 
of the lysine biosynthesis pathway. Alternatively in this embodiment, the 
sequence of the first nucleic acid comprises about 6 contiguous nucleotides, 
preferably about 10 contiguous nucleotides, even more preferably about 12, 15, 
20, 30, 40, 50, 100, 150, 200, or 500, contiguous nucleotides, of SEQ ID NO:2, 
or the first nucleic acid is at least 10 nucleotides in length, preferably at least 12, 
15,30, 50 or 1 50 nucleotides in length, and hybridizes under stringent conditions 
to a reference nucleic acid, or the complement thereof, wherein the sequence of 
said reference nucleic acid is SEQ ED NO:2. A preferred amino acid biosynthesis 
pathway results in the synthesis of L-lysine. Examples of components of the L- 
lysine biosynthesis pathway are disclosed herein. In this embodiment, the first 
nucleic acid, which comprises the C. ghitamicam aceA promoter, regulates 
transcription of the second nucleic acid. 
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Yet another embodiment of the invention provides an isolated 
polynucleotide comprising a first nucleic acid, the sequence of which is at least 
90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID 
NO: 3; and a second nucleic acid which encodes polypeptide which functions as 
a component of an amino acid biosynthesis pathway. Alternatively in this 
embodiment, the sequence of the first nucleic acid comprises about 6 contiguous 
nucleotides, preferably aboutlO contiguous nucleotides, even more preferably 
about 12, 15, 20, 30, 40, 50, 100, 150, 200, or 500, contiguous nucleotides, of 
SEQ. ID NO:3, or the first nucleic acid is at least 10 nucleotides in length, 
preferably at least 12, 15, 30, 50 or 150 nucleotides in length, and hybridizes 
under stringent conditions to a reference nucleic acid, or the complement thereof, 
wherein the sequence of said reference nucleic acid is SEQ ID NO:3. A 
preferred amino acid biosynthesis pathway results in the synthesis of L-lysine. 
Examples of amino acid biosynthesis pathways, and specific components of the 
L-lysine biosynthesis pathway are disclosed herein. In this embodiment, the first 
nucleic acid, which comprises the C. glutamicum aceB promoter, regulates 
transcription of the second nucleic acid. 

A further embodiment of the invention provides an isolated 
Corynebacterium chromosome with a first nucleic acid integrated into the 
chromosome, the sequence of which is at least 90%, 91 %, 92%, 93%, 94%, 95%, 
96%, 97%, 98%, or 99% identical to SEQ ID NO:l. By "an isolated 
Corynebacterium species chromosome" is meant a bacterial chromosome which 
has been altered in some way to affect gene expression, either of naturally- 
occuring genes or heterologous genes, and then isolated from other non-altered 
chromosomes, e.g., by selection and isolation of a host cell harboring the 
chromosome. In this embodiment, the isolated Corynebacterium species 
chromosome also has a second nucleic acid integrated therein, wherein said 
second nucleic acid encodes polypeptide which functions as a component of an 
amino acid biosynthesis pathway. A preferred amino acid biosynthesis pathway 
results in the synthesis of L-lysine. Examples of amino acid biosynthesis 
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pathways, and specific components of the L-lysine biosynthesis pathway are 
disclosed herein. In this embodiment, the first nucleic acid, which comprises the 
C. glutamicum pta promoter, regulates transcription of the second nucleic acid. 

Another embodiment of the invention provides an isolated 
Corynebacterium chromosome with a first nucleic acid integrated into the 
chromosome, the sequence of which is at least 90%, 91%, 92%, 93%, 94%, 95%, 
96%, 97%, 98%, or 99% identical to SEQ ID NO:23. In this embodiment, the 
isolated Corynebacterium species chromosome also has a second nucleic acid 
integrated therein, wherein said second nucleic acid encodes polypeptide which 
functions as a component of an amino acid biosynthesis pathway, preferably an 
L-lysine biosynthesis pathway. In this embodiment, the first nucleic acid, which 
comprises the E. coli araBAD promoter and the araC regulator molecule, 
regulates transcription of the second nucleic acid. The araC/ardBAD 
transcriptional complex is a tightly regulated system responsive to the regulator 
molecule arabinose. 

Yet another embodiment of the invention provides an isolated 
Corynebacterium chromosome with a first nucleic acid integrated into the 
chromosome, the sequence of which is identical to either SEQ IDNO:26 or SEQ 
ID NO:27. As with the embodiments described above, the isolated 
Corynebacterium species chromosome also has a second nucleic acid integrated 
therein, wherein said second nucleic acid encodes polypeptide which functions 
as a component of an amino acid biosynthesis pathway, preferably an L-lysine 
biosynthesis pathway, and the first nucleic acid, which is a trc or tac synthetic 
promoter, regulates transcription of the second nucleic acid. These synthetic 
promoters combine the -3 5 region from the trp promoter with the - 1 0 region from 
the lac\JW5 promoter. See Brosius, J. et al, J. Biol Chem. 260, 3539 (1985). 
In the embodiment, the isolated chromosome further has a third nucleic acid 
integrated therein, the sequence of which is at least 90%, 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO:28. This third nucleic 
acid is operably linked to a transcription control region, and the nucleic acid 
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encodes a polypeptide at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 
98%, or 99% identical to SEQ ID NO:29, the R coli lac repressor gene. The 
lacl q -/rc (SEQ ID NO.30) or the lacl q -tac transcriptional complex is responsive 
to the regulatory molecules isoproplythiogalactoside (IPTG) and lactose. This 
regulatory complex in Corynebacterium glutamicum, however, is leaky, i.e., 
transcription occurs in the absence of any added inducer. 

In each of the isolated chromosomes described above, the second nucleic 
acid may be a naturally occurring Corynebacterium nucleic acid, situated in its 
normal position on the chromosome, it may be a naturally occurring 
Corynebacterium nucleic acid which has been moved to a different position on 
the chromosome, or it may be a non-native nucleic acid. The first nucleic acid is 
engineered to be in a physical location which allows it to regulate transcription 
of the second nucleic acid either through in vitro cloning of a vector having both 
the first and second nucleic acids followed by homologous or random 
recombination into the Corynebacterium species chromosome, or it is inserted in 
front of a selected second nucleic acid already situated in the chromosome 
through homologous recombination. Such manipulations are readily understood 
by those of ordinary skill in the art. 

In another aspect of the above embodiments, a Corynebacterium species 
host cell comprising a Corynebacterium species chromosome as described above 
is provided, preferably a host cell derived from Corynebacterium glutamicum, 
Brevibacterium flavum, and Brevibacterium lactofermentum, and even more 
preferably derived from Corynebacterium glutamicum. In a related aspect, a 
method of producing such a Corynebacterium species host cell is provided, which 
comprises transforming Corynebacterium species cells with a vector comprising 
a first nucleic acid as described above, wherein said vector facilitates integration 
of the first nucleic acid into the chromosome of said Corynebacterium species 
cells, and selecting the host cell in certain preferred embodiments, the vector also 
comprises the second nucleic acid described above, and/or the third nucleic acid 
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described above, with the first nucleic acid being physically situated to regulate 
transcription of the second nucleic acid. These vectors are also provided. 

The present invention further relates to variants of the polynucleotides of 
the present invention. Variants may occur naturally, such as a natural allelic 

5 variant. By an "allelic variant" is intended one of several alternate forms of a 

gene occupying a given locus on a chromosome of an organism. Genes II, Lewin, 
B., ed., John Wiley & Sons, New York (1985). Non-naturally occurring variants 
may be produced using art-known mutagenesis techniques. 

Such variants include those produced by nucleotide substitutions, 

10 deletions or additions which may involve one or more nucleotides. Particularly 

preferred are variants which alter the level of activity of the polynucleotide 
relative to a non-variant polynucleotide. For example, variants which increase or 
decrease the strength of a promoter, or variants which alter or eliminate the need 
for an inducer. Methods to construct variant promoters and screen them for their 

15 relative strengths utilizing reporter genes are well known in the art. See, e.g., 

Vasicova,^/., J. Bacteriol 757:6188-6199 (1999), which is incorporated herein 
by reference in its entirety. 

D. Vectors and Host Cells 

The present invention also relates to vectors which include 
polynucleotides of the present invention, host cells which are engineered with 
vectors of the invention and methods to increase the production of metabolic 
products such as amino acids purines, pyrimidines, and heterologous polypeptides 
through utilization of host cells of the invention. 

Host cells can be engineered to incorporate polynucleotides of the present 
invention. The polynucleotides may be introduced alone or with other 
polynucleotides. Such other polynucleotides may be introduced independently, 
co-introduced or introduced joined to the polynucleotides of the invention. 



20 



25 
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In accordance with this aspect of the invention the vector may be, for 
example, a plasmid vector or a bacteriophage vector. Such vectors may be 
introduced into Corynebacterium cells such that they are maintained and 
replication as an extrachromosomal element, or, alternatively, such that they 
integrate into the chromosome. Vectors are introduced into host cells by well 
known techniques for introducing DNA and RNA into prokaryotic cells, 
examples of which are described in the Examples, infra. 

Preferred vectors comprise a first nucleic acid which is capable of 
regulating transcription of a second nucleic acid which is inserted in operable 
association, preferably immediately downstream, of the first nucleic acid. The 
second nucleic acid preferably encodes a polypeptide such as an enzyme in an 
amino acid biosynthesis pathway. Especially preferred among vectors are those 
which allow convenient insertion of the second nucleic acid into the vector into 
the proper position to be regulated by the first nucleic acid, for example, through 
the use of a multiple cloning region. Appropriate trans-acting factors such as 
inducers either are supplied by the host cell, supplied by a complementing vector 
or supplied by the vector itself upon introduction into the host cell. 

A great variety of vectors can be used in the invention. Such vectors 
include chromosomal, episomal and virus-derived vectors e.g., vectors derived 
from bacterial plasmids and from bacteriophage, as well as vectors derived from 
combinations thereof, such as those derived from plasmid and bacteriophage 
genetic elements, such as cosrnids and phagemids, all may be used in accordance 
with this aspect of the present invention. Generally, any vector suitable to 
maintain and propagate a polynucleotide in a bacterial host may be used in this 
regard. 

A large numbers of suitable vectors for use in bacteria are known, many 
of which are commercially available. Preferred prokaryotic vectors include 
plasmids such as those capable of replication in E. coli (such as, for example, 
pBR322, ColEl, pSClOl, pACYC 184, tcVX. Such plasmids are, for example, 
disclosed by Maniatis, T., et al s In: Molecular Cloning, A Laboratory Manual, 
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Cold Spring Harbor Press, Cold Spring Harbor, NY (1982)). The following 
vectors are provided by way of example: pET (Novagen), pQE70, pQE60, pQE-9 
(Qiagen), pBs, phagescript, psiX174, pBlueScript SK, pBsKS, pNH8a, pNH16a, 
pNHlSa, pNH46a (Stratagene); P Trc99A, pKK223-3, pKK233-3, pDR540, 
pRIT5 (Pharmacia). 

Preferred vectors for the isolated polynucleotides of the invention include 
the pFC 1 to pFC7 novel family of combinatorial cloning vectors (Lonsdale, D.M., 
et al, Plant Molecular Biology Reporter 13: 343-345 (1995)), the pK184 vector 
(Jobling, M.G. and Homes, R.K., Nucleic Acid Research 18: 5315-5316 (1990)). 

Another group of preferred vectors are those that are capable of 
autonomous replication in Corynebacterium species. Such vectors are well 
known to those skilled in the art of amino acid production by way of microbial 
fermentation, examples of which include pSRl, pMF1014a and vectors derived 
therefrom. 

The second nucleic acid encoding a polypeptide is operatively associated 
with the first nucleic acid as described above, which provides the appropriate 
transcription control sequence(s)), in particular, a promoter. In addition, the 
vector may contain sites for transcription initiation and termination, and, in the 
transcribed region, a ribosome binding site for translation. The second nucleic 
acid, which provides the coding region of the polypeptide to be regulated will 
include a translation initiating AUG or GUG at the beginning and a termination 
codon (UAA, UGA or UAG) appropriately positioned at the end of the coding 
region. 

In addition, and as described in more detail herein, the vector may contain 
control regions that regulate as well as engender expression. Generally, such 
regions will operate by controlling transcription, such as inducer or repressor 
binding sites and enhancers, among others. 

Vectors of the present invention generally will include a selectable 
marker. Such markers also may be suitable for amplification or the vectors may 
contain additional markers for this purpose. In this regard, vectors preferably 
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contain one or more selectable marker genes to provide a phenotypic trait for 
selection of transformed host cells. Such markers include, but are not limited to, 
an antibiotic resistance gene such as a chloramphenicol, ampicillin, or kanamycin 
resistance gene, or an autotrophic gene which allows the host cell to grow in the 
5 absence of a nutrient for which the host cell strain is normally auxotrophic. 

If the vector is intended to be maintained in the host cell 
extrachromosomally, it will contain, in addition and origin of replication which 
will allow it to replicate in the Corynebacterium species host cell. Alternatively, 
if it is desired that the vector integrate into the Corynebacterium species 

10 chromosome, the vector is constructed such that it cannot replicate in 

Corynebacterium. For example, such a vector might be capable of propagation 
in another organism, for example, E. coli, but lack the proper origin of replication 
to be propagated in Corynebacterium. In another aspect of this embodiment, the 
vector is a shuttle vector which can replicate and be maintained in more than one 

15 host cell species, for example, such a shuttle vector might be capable of 

replication in a Corynebacterium host cell such as a C glutamicum host cell, and 
also in an E. coli host cell. 

A vector comprising a polynucleotide as described elsewhere herein may 
be introduced into an appropriate Corynebacterium species host cell using a 

20 variety of well known techniques, for example, transformation or electroporation. 

Appropriate culture mediums and conditions for culturing Corynebacterium host 
cells are known in the art. 

The invention provides methods for increasing amino acid production and 
processes for the production of an amino acid wherein increased biosynthetic 

25 pathway gene expression is accomplished through insertion of an isolated 

polynucleotide into the chromosome of a host cell. For example, insertion of 
isolated polynucleotides into the chromosome of Corynebacterium species may 
be done utilizing the pK184 plasmid described by Jobling, M. et al 9 Nucleic 
Acids Research 18(17): 5315-5316 (1990). Because these vectors lack a 

30 Coiynebacterium species origin of replication and contains a selectable marker 
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such as kanamycin (kari), cells will only be capable of growing under selection 
if the vector has been inserted into the host cell chromosome by homologous 
recombination. 

The invention also provides methods for increasing amino acid production 
5 and processes for the production of an amino acid wherein increased biosynthetic 

pathway gene expression is accomplished through the introduction into a host cell 
of a self-replicating, extra-chromosomal vector, e.g., a plasmid, comprising an 
isolated nucleic acid molecule encoding an amino acid biosynthetic pathway gene 
or genes. Suitable plasmids for these embodiments include pSRl and other 
10 derivatives of pSRl (Archer, J. etal, J. Gen. Microbiol 139: 1753-1759(1993)). 

These vectors are listed solely by way of illustration of the many vectors 
available to those of skill in the art. Other vectors are described in the Examples, 
infra. Selection of appropriate vectors to transform into a host cell is a well 
known procedure and the requisite techniques for vector construction, 
1 5 introduction of the vector into the host and maintenance in the host are routine 

skills in the art. 

The present invention also relates to Corynebacteriam species host cells 
comprising the above-described vector constructs described herein. The host cell 
can be any Corynebacterium species, preferably Corynebacteriwn glutamicum, 
20 Brevibacteriumflavum, or Brevibacterium lactofermentum, even more preferably 

a strain of C. glutamicum. Preferred strains of C. glutamicum to use as host cells 
of the invention include: NRRL-B 11474, ATCC 21799, ATCC 21529, ATCC 
21543, and E12. 

Introduction of a vector into the host cell can be effected by 
25 transformation, electroporation, transduction, infection or other methods. Such 

methods are described in many standard laboratory manuals, such as Davis et al, 
Basic Methods in Molecular Biology (1986). 
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£ Methods and Processes of the Invention 

Various embodiments of the invention provide methods to increase the 
production of an amino acid and processes for the production of an amino acid 
from a Corynebacterium species host cell. Particularly preferred 
Corynebacterium species of the methods and processes of the invention include: 
Corynebacterium glutamicum, Brevibacterium flavum, Brevibacterium 
lactofermentum and other Cornynebacteria and Brevibacteria species known in 
the art. 

Amino acid biosynthetic pathway genes embodied by the methods and 
processes described herein include those for L-glycine, L-alanine 5 L-methionine, 
L-phenylalanine, L-tryptophan, L-proline 3 L-serine, L-threonine, L-cysteine, L- 
tyrosine, L-asparagine, L-glutamine, L-aspartic acid, L-glutamic acid, L-lysine, 
L-arginine 3 L-histidine, L-isoleucine, L-leucine, and L-valine biosynthesis. 
Particularly preferred embodiments are drawn to biosynthetic pathway genes for 
L-lysine (Sahm et al,Ann. N. Y. Acad Sci. 782: 25-39 (1996)), L-threonine, L- 
isoleucine, L-tryptophan, and L-valine. 

By way of example, the amino acid pathway for L-lysine biosynthesis is 
well known to skilled artisans of amino acid production in Corynebacterium 
species. Genes encoding the enzymes important for the conversion of L-aspartate 
to L-lysine include the ask, asd, dapA, dapB, ddh, and lysA genes, which encode, 
respectively, aspartokinase, aspartate beta-semialdehyde dehydrogenase, 
dihydrodipicolinate synthetase, dihydrodipicolinate reductase, diaminopimelate 
dehydrogenase, diaminopimelate decarboxylase, and pyruvate carboxylase. Thus, 
the invention provides herein for exemplary purposes only, specific embodiments 
utilizing L-lysine biosynthetic pathway genes. Other embodiments drawn to the 
use of biosynthetic pathway genes for the synthesis of other amino acids are also 
encompassed by the invention described herein. 

The methods to increase the production of a biosynthetic product such as 
an amino acid and the processes for the production of the biosynthetic product of 
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the invention involve increasing the expression of at least one biosynthetic 
pathway gene. Preferred methods of increasing expression comprise using 
promoters of the invention. 

Accordingly, the present invention provides a first nucleic acid which 
preferably comprises a promoter. Preferred promoters include, but are not limited 
to the following types of promoters: a constitutive promoter, an inducible 
promoter, a derepressable promoter, a heat sensitive promoter, and a cold 

■ 

sensitive promoter. In this aspect of the invention, a constitutive promoter causes 
the second nucleic acid to be transcribed at a substantially constant rate; an 
inducible promoter will be non-functional or will function at a reduced level 
under certain standard culture conditions, but causes elevated transcription of the 
second nucleic acid when the transformed host cells are grown in the presence of 
an appropriate inducer substance; a derepressable promoter causes elevated 
transcription of the second nucleic acid when the transformed host cells are 
grown in the absence of an appropriate suppressor substance, but will be non- 
functional or will function at a reduced level when the suppressor substance is 
present; a heat sensitive promoter causes elevated transcription of the second 
nucleic acid when the transformed host cells are grown in medium the 
temperature of which is adjusted to a temperature that is greater than the optimum 
temperature for wild-type untransformed host cells; and a cold sensitive promoter 
causes elevated transcription of the second nucleic acid when said transformed 
host cells are grown in a medium the temperature of which is adjusted to a 
temperature that is less than the optimum temperature for a wild-type 
untransformed host cell. 

Also included within this embodiment of the invention are methods 
wherein the transformed host cells are grown to a sufficient cell density according 
to methods well known in the art. Further included in this embodiment are 
methods for substantially purifying the biosynthetic product from said culture, 
wherein said purification methods are also well known in the art. 
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In certain embodiments of the present invention, an inducible promoter 
is used. When an inducible promoter is used, it is preferred that the transformed 
Corynebacterium species host cells are first allowed to proliferate to a sufficient 
cell density in the absence of the inducer, followed by addition of the inducer to 
the culture medium thereby stimulating production of said product. Examples of 
inducers include, but are not limited to one or more of the following substances: 
acetic acid, pyruvate, ethanol, fatty acids, cellulose subunits, starch subunits, 
triglycerides, or any of the following sugars: fructose, maltose, lactose, or 
arabinose. 

In other embodiments, a derepressable promoter is used. When a 
derepressable promoter is used, it is preferred that the transformed 
Corynebacterium species host cells are first allowed to proliferate to a sufficient 
cell density in the presence of the suppressor, followed by either removal of the 
suppressor from the culture, or replacement of the medium containing the 
suppressor with medium lacking the suppressor, thereby stimulating production . 
of said product. Under certain conditions, the suppressor is removed from the 
medium by being metabolized by the host cell. Examples of said suppressors 
include, but are not limited to any of the following substances: a purine, a 
pyrimidine, oxygen, or a suppressor amino acid including histidine. 

In other embodiments, the promoter is self-inducing, Le. 9 the inducer 
molecule is produced by the host cell and builds up through the course of the 
growth, thereby changing gene expression as growth progresses. In this 
embodiment, the promoter is induced either gradually or at some point in the 
growth of the host cell culture, however this induction does not require the 
addition of exogenous compounds to the culture medium. 

In yet another embodiment, a heat sensitive promoter is used. When a 
heat sensitive promoter is used, it is preferred that the transformed host cells are 
first allowed to proliferate to a sufficient cell density in culture medium, the 
temperature of which is adjusted to the optimum growth temperature for 
untransformed host cells, followed by adjusting the temperature of said culture 
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medium to a temperature that is greater than the optimum growth temperature for 

i 

untransformed host cells. 

In yet another embodiment, a cold sensitive promoter is used. When a 
cold sensitive promoter is used, it is preferred that the transformed host cells are 
first allowed to proliferate to a sufficient cell density in culture medium, the 
temperature of which is adjusted to the optimum growth temperature for 
untransformed host cells, followed by adjusting the temperature of said culture 
medium to a temperature that is less than the optimum growth temperature for 
untransformed host ceils. 

In a related embodiment of the invention, a method is provided for the 
production of a product such as an amino acid, a purine nucleotide, or a 
heterologous polypeptide in a Corynebacterium species host cell utilizing a 
heterologous transcriptional complex. According to this embodiment, the host 
cell harbors the heterologous transcriptional complex integrated into the bacterial 
chromosome. Preferred heterologous transcriptional complexes include the E. 
coli araC-araBAD transcriptional complex (SEQ ID NO:23), and the lacl q -/rc or 
lacl q -tac transcriptional complexes which combine the E. coli lac\ q repressor gene 
(lacP, SEQ ID NO:28) with either the trc promoter (SEQ ID NO:26 or SEQ ID 
NO:30) or the tac promoter (SEQ ID NO:27). These synthetic promoters 
combine the -35 region from the trp promoter with the -10 region from the 
lac\JW5 promoter. See Brosius, J. etal y J. Biol Chem. 260, 3539 (1985). 

The method utilizing the E. coli araC-araBAD transcriptional complex 
preferably further comprises culturing the host cell in or on a medium to which 
the inducer arabinose has been added, either throughout growth of the host cells, 
or at a point during growth where the host cell culture has reached an optimal 
density. The method utilizing either the lacl q -fr*c or the lacl q -fac transcriptional 
complex may further comprise culturing the host cell in or on a medium to which 
the inducer isoproplythiogalactoside (IPTG) or the inducer lactose has been 
added, either throughout growth of the host cells, or at a point during growth 
where the host cell culture has reached an optimal density. However, in C. 
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glutamicum, the laclVtrc system is leaky, therefore an inducer is not required for 
optimal growth. 

A preferred aspect of this embodiment provides a method for the 
production of an amino acid in a Corynebacierium host cell utilizing a 
heterologous transcriptional complex integrated into the host cell chromosome, 
where the heterologous transcriptional complex operably regulates production a 
polypeptide which functions in an amino acid biosynthesis pathway. In this 
aspect, the heterologous transcriptional complex, when integrated into the host 
cell chromosome, regulates transcription of one or more metabolic pathway 
enzymes, thereby resulting in elevated production of the amino acid by the 
transformed host cell. 

One aspect of this embodiment involves addition of an inducer to the 
culture medium thereby upregulating the heterologous transcriptional complex. 
In the method utilizing the E. coli araC-araBAD transcriptional complex 
arabinose is added to the medium. In the method utilizing the lacl q -frr or the 
lacl q -tac transcriptional complex, IPTG and/or lactose is added to the culture 
medium. Preferably in this method, the host cells are grown to a sufficient cell 
density according to methods well known in the art prior to addition of the 
inducer molecule. 

For those embodiments of the invention drawn to a method to increase 
production of an amino acid, screening for increased production of an amino acid, 
for example L-lysine, may be determined by directly comparing the amount of L- 
lysine produced in culture by a standard Corynebacierium species strain to that 
of a Corynebacterium species host cell which has been transformed to comprise 
an isolated polynucleotide of the present invention. The level of production of 
the amino acid of choice may conveniently determined by the following formula 
to calculate the percent yield from dextrose: [(g amino acid/L / (g dextrose 
consumed/L)] *100. 

A variety of media known to those skilled in the art may be used to 
support cell growth for the production of an amino acid. Illustrative examples of 
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suitable carbon sources include, but are not limited to: carbohydrates, such as 
glucose, fructose, sucrose, starch hydrolysate, cellulose hydrolysate and molasses; 
organic acids, such as acetic acid, propionic acid, formic acid, malic acid, citric 
acid, and fumaric acid; and alcohols, such as glycerol. Illustrative examples of 
suitable nitrogen sources include, but are not limited to: ammonia, including 
ammonia gas and aqueous ammonia; ammonium salts of inorganic or organic 
acids, such as ammonium chloride, ammonium phosphate, ammonium sulfate and 
ammonium acetate; and other nitrogen-containing sources, including meat 
extract, peptone, corn steep liquor, casein hydrolysate, soybean cake hydrolysate, 
urea and yeast extract. 

A variety of fermentation techniques are known in the art which may be 
employed in processes of the invention drawn to the production of amino acids. 
Generally, amino acids may be commercially produced from the invention in 
fermentation processes such as the batch type or of the fed-batch type. In batch 
type fermentations, all nutrients are added at the beginning of the fermentation. 
In fed-batch or extended fed-batch type fermentations one or a number of 
nutrients are continuously supplied to the culture,, right from the beginning of the 
fermentation or after the culture has reached a certain age, or when the nutrient(s) 
which are fed were exhausted from the culture fluid. A variant of the extended 
batch of fed-batch type fermentation is the repeated fed-batch or fill-and-draw 
fermentation, where part of the contents of the fermenter is removed at some 
time, for instance when the fermenter is full, while feeding of a nutrient is 
continued. In this way a fermentation can be extended for a longer time. 

Another type of fermentation, the continuous fermentation or chemostat 
culture, uses continuous feeding of a complete medium, while culture fluid is 
continuously or semi-continuously withdrawn in such a way that the volume of 
the broth in the fermenter remains approximately constant. A continuous 
fermentation can in principle be maintained for an infinite time. 

In a batch fermentation an organism grows until one of the essential 
nutrients in the medium becomes exhausted, or until fermentation conditions 
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become unfavorable (e.g. the pH decreases to a value inhibitory for microbial 
growth). In fed-batch fermentations measures are normally taken to maintain 
favorable growth conditions, e.g. by using pH control, and exhaustion of one or 
more essential nutrients is prevented by feeding these nutrient(s) to the culture. 
The microorganism will continue to grow, at a growth rate dictated by the rate of 
nutrient feed. Generally a single nutrient, very often the carbon source, will 
become limiting for growth. The same principle applies for a continuous 
fermentation, usually one nutrient in the medium feed is limiting, all other 
nutrients are in excess. The limiting nutrient will be present in the culture fluid 
at a very low concentration, often immeasurably low. Different types of nutrient 
limitation can be employed. Carbon source limitation is most often used. Other 
examples are limitation by the nitrogen source, limitation by oxygen, limitation 
by a specific nutrient such as a vitamin or an amino acid (in case the 
microorganism is auxotrophic for such a compound), limitation by sulphur and 
limitation by phosphorous. 

The amino acid may be recovered by any method known in the art. 
Exemplary procedures are provided in the following: Van Walsem, H.J. & 
Thompson, M.C., J. Biotechnol 59: 127-132 (1 997), and U.S. Pat. No. 3,565,95 1 , • 
both of which are incorporated herein by reference. 

Examples 
Example 1 

Production of Host Cells Utilizing Transcriptional Regulatory Regions of 

the Present Invention 

Corynebacteriwn species host cells, in which to express metabolic 
polypeptides are constructed as follows. Polynucleotides of the present invention 
which comprise transcriptional regulatory regions related to SEQ ID NOs 1-23, 
26-28, and 30 are inserted upstream of a multiple cloning region, thereby 
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allowing convenient insertion coding regions to be regulated and expressed. 
Testing of the various transcriptional regulatory regions is conveniently carried 
by inserting the various regulatory regions in operable association to a known 
reporter gene, such as P-galactosidase (lacZ). Methods and techniques common 
to the art of recombinant DNA technology are used in making vectors of the 
invention, as may be found in the many laboratory manuals cited and incorporated 
herein, for example as found in J. Sambrook, E.F. Fritsch and T. Maniatis, 
Molecular Cloning: A Laboratory Manual, 2d ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (1989). Construction of 
exemplary vectors is set forth in Example 2, infra. 

For cell transformation experiments with the isolated nucleic acid 
molecules of the invention, the growth and preparation of competent cells may 
be done according to the following procedure: (1) picking a fresh, single colony 
of Corynebacterium glutamicum and growing a culture overnight in 10 mL CM 
(SMI) in a 250 mL shake flask at 30 degrees Celsius with agitation; (2) 
inoculating 200 mL of "Growth Media" with the overnight culture to an optical 
density (O.D.) of 660 nm of 0. 1 in a 500 mL shake flask; (3) growing the culture 
at 30 degrees Celsius with agitation for 5-6 hours; (4) pouring the culture into a. 
chilled, sealed, sterile 250 mL centrifuge bottle; Spin at 8-1 OK for ten minutes 
in Refrigerated Sorvall at 4 degrees Celsius; (5) pouring off the supernatant 
thoroughly and resuspending the cell pellet in an equal volume of ice-cold, 
sterile, deionized water; (6) centrifuging the sample again under the same 
conditions; (7) repeating the water wash remembering to keep everything 
ice-cold; (8) pouring off the supernatant thoroughly and resuspending the cell 
pellet in 1 mL of ice-cold, sterile 10% glycerol and transferring the cells to a 
chilled, sterile, 1.5 mL microcentrifuge tube; (9) spin the sample for 10 minutes 
in a refrigerated centrifuge; (10) pipetting off and discarding the supernatant, and 
resuspending the pellet in two to three times the pellet volume (200-400 |iL) of 
10% glycerol; and (1 1) aliquoting, if necessary, the cells into chilled tubes and 
freezing at -70 Celsius. 
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Plasmid DNAs are introduced into Coiynebacteriwn glutamicum host 
cells by the following electroporation procedure: (1 ) pipetting 3 5 jiL cell/glycerol 
solution onto the side wall of a chilled 0.1cm electrocuvette; (2) pipetting about 
2-4 ^iL of plasmid into the solution and mixing the sample by gentle pipetting up 
and down; (3) bringing the entire solution to the bottom of the electrocuvette by 
gentle tapping, avoiding the creation of bubbles; (4) keeping the sample on ice 
until ready for the electroshock step, wiping off any moisture on the outside of the 
electrocuvette prior to the electroshock administration, and shocking the cells one 
time at 1 .5kV, 200n , 25^F. 

Cells are allowed to recover from electroporation by: (1) immediately 
pipetting 1 mL of warm "Recovery Media" into the electrocuvette and thoroughly 
mixing the solution by pipetting; (2) incubating the solution (in the 
electrocuvette) at 30 degrees Celsius for at least three hours for antibiotic 
resistance expression and cell recovery and (3) plating on selection media and 
incubating at 30 degrees Celsius for 3 days. 

Example 2 

Preparation of L-Lysine Pathway Constructs Utilizing the Regulatory 

Regions of the Present Invention 

Isolated nucleic acid molecules encoding L-lysine amino acid biosynthesis 
pathway genes are isolated by methods known to those of ordinary skill in the art. 

Constructs which facilitate expression and regulation of L-lysine amino 
acid biosynthesis pathway genes are introduced into Corynebacterium strains by 
methods such as those described in Example 1 . Any strain of Corynebacterium, 
particularly that of Cojynebacterium glutamicum, may be utilized for the isolation 
of nucleic acid molecules for use in the preparation of vectors to improve amino 
acid biosynthetic pathway gene expression. Particularly preferred strains include : 
NRRL-B1 1474, ATCC 21799, ATCC 21529, ATCC 21543, and E12. As one 
skilled in the art would know, the invention is not limited to these specific strain 
origins. 
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Methods and techniques common to the art of recombinant DNA 
technology are used in making vectors of the invention, as may be found in the 
many laboratory manuals cited and incorporated herein. 

The polymerase chain reaction (PCR) technique may be used in the 
making of vectors of the invention. In a typical reaction, the standard 1 OX stock 
solution (100 mM Tris-HCL, pH 8.3, 500 mM KCL, 1.5 mM MgCl 2 ) is diluted 
to IX for use. Typical reaction conditions were used for PCR amplication: 10 
mM Tris, pH 8.3 , 50 mM KC1, 1.5 mM MgC12, 0.01% gelatin, 200uM 
deoxynucleotides, 0.2-1 .OuM primers and 2.0 U/l OOul pfu polymerase. Standard 
cycling parameters were also employed in PCR reactions: For 30 cycles, template 
denaturation was performed at 94 ° C for 3 0 sec. ; 65 ° C annealing temperature was 
performed for 1 min (or annealing temperature appropriate for, particular primer 
pair); product extension was performed at 72 °C for 1 min (if product is <500 bp), 
3 min (if product is >500 bp); and at the end of cycling, a final extension at 72°C 
for 7 min was performed. 

(a) Construction of pTrcLacZ 

A plasmid construct designed to replicate as a plasmid in 
Corynebacterium, which contains the P-galactosidase QacZ) coding region 
operably associated with the laclVtrc regulatory region, was constructed as 
follows. Plasmid pZap2, a plasmid which is capable of replicating in 
Corynebacterium, was digested with Xbal and Smal, which linearize the plasmid 
just upstream of the lacZ gene. A cassette containing lacF and the trc promoter 
was excised from plasmid pTrc99A, available from Amersham Pharmacia 
Biotech, by digestion with Sphl y filling in the overhang, and then digesting with 
Xbal The 1517-bp cassette (SEQ ID NO:30) was isolated from an agarose gel, 
and ligated into the linearized pZap2 plasmid. The resulting ligation mix was 
used to transform Escherichia coli, and colonies were isolated from LB plates 
supplemented with ampicillin (100 ng/ml), Xgal, and IPTG (LBXIA medium). 
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Blue colonies were selected and plasmid isolates were verified by restriction 
digestion and electrophoresis. 

(b) Construction of pAraBADLacZ 

A plasmid construct designed to replicate as a plasmid in 
Corynebacterium, which contains the lacZ coding region operably associated 
with the araC/araBAD regulatory region, was constructed as follows. Plasmid 
pZap2 was digested with Xbal and Smal as in (a). A cassette containing the araC 
gene and the pBAD promoter was excised from plasmid pBADl 8-Cm, available 
from the American Type Culture Collection, Catalog No. 87396, by digestion 
with CM followed by a fill-in reaction and then digestion with Xbal. The cassette 
(i.e., nucleotides 1 to 1338 of SEQ ID NO:23) was isolated from an agarose gel 
and was ligated into the linearized pZap2 plasmid. Transformed E. coli colonies 
were plated on LBXIA medium as in (a), blue colonies were selected, and 
plasmid isolates were verified by restriction digestion and electrophoresis. 

(c) Construction of pAceB10Zap2 

A plasmid construct designed to replicate as a plasmid in 
Corynebacterium, which contains the lacL coding region operably associated 
with the C glutamicum aceB regulatory region, was constructed as follows. A 
cassette containing the aceB promoter was PCR amplified from plasmid pSLD5 
using forward primer 5' GTGCGGATCC TGGCTTTCCA ACGTTT 3' (SEQ ID 
NO :3 1 , BarnHl site underlined) and reverse primer 5' CATGGG CTCGAG ATGA 
CCTGTGCCTA 3' (SEQ ID NO:32, Sad site underlined). The resulting PCR 
product was digested with BamHl and Sad and ligated into the polylinker region 
of cloning vector pDll to produce pDllaceB. The resulting plasmid was 
screened for orientation, and then the aceB promoter cassette was released from 
the vector by digestion with Kpnl and Xbal and cloned into the Kpnl and Xbal 
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sites of plasmid pZap2 as described in (a). Transformed E. coli colonies were 
plated on LBXIA medium as in (a), blue colonies were selected, and plasmid 
isolates were verified by restriction digestion and electrophoresis. 

(d) Construction ofl31aceB/ask 

A plasmid construct designed to integrate into the C. glutamicum 
chromosome, which contains the C. glutamicum aspartokinase (ask) coding 
region operably associated with the C glutamicum aceB regulatory region, was 
constructed as follows. Plasmid pBGS 13 1 (described in Spratt, B.G. et al. Gene 
47:337-342 (1986)) was digested with SmaL The aceB cassette was released 
from plasmid pDllaceB, produced as in (c), by digestion with appropriate 
restriction enzymes and the cassette was subjected to a fill-in reaction to produce 
blunt ends. The cassette was then ligated into the digested pBGS131. Plasmid 
isolates were tested for the proper orientation of the aceB promoter by restriction 
digestion and electrophoresis, resulting in plasmid 131 aceB . The C. glutamicum 
ask coding region was released from pTrc99Aas£ (produced as in (f), infra) by 
digestion with Sacl and Kpnl and cloned into the Sacl and Kpnl sites of 1 3 1 aceB , 
to produce 1 3 laceBask. 

(e) Construction of PDlOaceBask 

A plasmid construct designed to replicate as a plasmid in C. glutamicum, 
which contains the C. glutamicum aspartokinase (ask) coding region operably 
associated with the C glutamicum aceB regulatory region, was constructed as 
follows. A cassette containing the ^coding region operably associated with the 
aceB regulatory region was released from 131 aceBask, produced as in (d), by 
digestion with BamHl and Kpnl, The resulting fragment was cloned into the 
BamHl and Kpnl sites of plasmid pDIO, which is capable of replicating in C. 
glutamicum, to produce pD 1 OaceBask. 
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(f) Construction of 131LacF/trc-ask 

A plasmid construct designed to integrate into the C ghitamicum 
chromosome, which contains the C. glutamicum aspartokinase (ask) coding 
region operably associated with the lacl q /trc regulatory region, was constructed 

5 as follows. The PCR-amplified ask coding region was digested with Sad and 

Kpnl, and ligated into plasmid pTrc99A which had been digested with Sacl and 
Kpnl to produce pTrc99AayA:. A cassette containing the ask coding region 
operably associated with the laclVtrc regulatory region was then released from 
pTrc99Aas£ by digestion with Sphl, and was cloned into the Sphl site of plasmid 

10 pBGSBl to produce UlLaelVtrc-ask. 

(g) Construction of Additional LacZ Plasmid Expression Constructs 

Plasmid constructs designed to replicate as a plasmid in Corynebacterium, 
which contain the lacZ coding region operably associated with C. glutamicum 
regulatory regions contained in SEQ ID NOs 1 , 2, and 4-22, are constructed as 

1 5 follows. Cassettes containing the regulatory regions present in SEQ ID NOs 1 , 

2, and 4-22 are PCR amplified from the C glutamicum chromosome using 
primers engineered to introduce appropriate restriction enzyme recognition sites 
onto the ends of the cassettes. In designing the primers, care is taken to introduce 
sites for restriction enzymes which will not digest the cassette internally. The 

20 resulting PCR products are digested with the chosen restriction enzymes, and the 

fragments are ligated into a polylinker region of a standard cloning vector. The 
resulting plasmids are screened for orientation, and then the cassettes are released 
from the vector by digestion with appropriate restriction enzymes, filled in as 
necessary, and cloned into the polylinker region upstream of the lacZ gene in 

25 plasmid pZap2 as described in (a). Transformed E. coli colonies are plated on 

LBXIA medium as in (a), blue colonies are selected, and plasmid isolates are 
verified by restriction digestion and electrophoresis. 
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(h) Construction Additional Aspartokinase Expression Constructs 
Designed to Integrate 

Plasmid constructs designed to integrate into the C glutamicum 
chromosome, which contain the C. glutamicum aspartokinase (ask) coding region 
operably associated with C glutamicum regulatory regions contained in SEQ ID 
NOs 1, 2, and 4-22, are constructed as follows. Plasmid pBGS131 is digested 
with Smal, or other appropriate restriction enzyme. The regulatory cassettes 
prepared as in (g) are released by digestion with appropriate restriction enzymes 
and the cassettes are subjected to fill-in reactions to produce blunt ends as needed. 
The cassettes are then ligated into the digested pBGS131. Plasmid isolates are 
tested for the proper orientation of the regulatory regions by restriction digestion 
and electrophoresis, to produce plasmids 131SEQIDNO:l, 131 SEQIDNO:2, 
andl31SEQIDNO:4-131 SEQIDNO:22. The C. glutamicum ask coding region 
is released from pTrc99 Aask (produced as in (f)) by digestion with Sacl and Kpril 
(or other appropriate restriction enzymes), or is PCR amplified from a C. 
glutamicum chromosome, and is cloned into the corresponding sites of plasmids 
131SEQIDNO:l,131SEQIDNO:2 3 andl31SEQIDNO:4-131SEQE)NO:22 5 
to produce the plasmids 131SEQ ID NO:l-as£, 131 SEQ ID NO:2-asA, and 
131SEQ ID NO:4-as£ through 131 SEQ IDNO:22-ak 

(i) Construction of Additional Aspartokinase Constructs Designed 
to Replicate as a Plasmid 

Plasmid constructs designed to replicate as a plasmid in C glutamicum, 
which contain the C. glutamicum aspartokinase (ask) coding region operably 
associated with C glutamicum regulatory regions contained in SEQ ED NO 1,2, 
and 4-22, are constructed as follows. Cassettes containing the ask coding region 
operably associated with the regulatory regions contained in SEQ ID NO: 1 , SEQ 
ID NO:2, and SEQ IDNO:4-SEQ IDNO:22 are released from plasmids 13 1SEQ 
ID NO:l-osJt 131 SEQ ID NO:2-ay£, and 131SEQ ID NO:4-^A through 131 
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SEQ ID NO:22-osfc, produced as in (h), by digestion with appropriate restriction 
enzymes. The resulting fragments are cloned into corresponding sites of plasmid 
pDIO, which is capable of replicating in C. glutamicum, to produce plasmids 
PD 1 0/SEQ ID NO: 1 -ask, PD 1 0/SEQ ID NO:2-oy£, and PD 1 0/SEQ ID NO :4-ask 
through PD10/SEQ ID NO:22-asfc. 

Example 3 

Use ofLacZasa Reporter to Demonstrate Transcriptional Regulation by 
araBAD (SEQ ID NO:23) and the lacF/trc Transcriptional Complex (SEQ 

ID NO:30) in Corynebacterium (41-004002) 

The ability of the heterologous transcriptional regulatory regions laclVtrc 
and araBAD (with the araC repressor) to control gene expression in C 
glutamicum was tested by measuring expression of P-galactosidase. Plasmids 
pTrcZacZ and pAraBADiacZ, produced as in Examples 2(a) and 2(b), 
respectively, were transferred into C glutamicum according to the methods 
described in Example 1 . The lacL reporter gene, regulated by either of the two 
regulatory regions, was expressed from a plasmid replicating in G glutamicum, 
grown in the presence of an inducer, IPTG in the case of the /acl q /trc regulatory 
region, or arabinose in the case of the araCl f araBAD regulatory region. The level 
of P-galactosidase production was measured, and the results are shown in Table 3 . 



Table 3 



regulator/promoter 


Inducer 


nmol/min/mg 






Mean 


S.D. 


none 


none 


0.40 


0.14 


LaclVtrc 


none 


3.84 


0.57 


LaclVtrc 


+IPTG (30mg/L) 


77.23 


0.32 


AraC/AraBAD 


none 


0.47 


0.49 


AraC/AraBAD 


+arabinose (1 g/L) 


5.51 


0.46 



The trc regulated construct shows uninduced activity. The araBAD regulated 
construct showed no uninduced activity and responded to arabinose. The 
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araBAD promoter resulted in a (i-galactosidase expression level that was 14-fold 
lower than that observed with the trc promoter in this organism. 

Example 4 

Use of fi-Galactosidase Activity to Demonstrate the Response of the G 
glutamicum aceB Promoter to Acetate, Glucose and Ethanol (98-0040002) 

The ability of the C glutamicum aceB regulatory region (SEQ ID NO:3) 
to control gene expression of a heterologous reporter gene in C. glutamicum was 
tested by measuring expression of p-galactosidase from plasmid pAceB 1 QZap2, 
produced as in Example 2(c). In this construct, the lacZ reporter gene, regulated 
by aceB, is expressed from a plasmid replicating in C glutamicum. Response of 
the regulatory region to various inducers was tested. The results are shown in 
Table 4. 



Table 4 



Rich Medium 


nmol/min/mg 


Minus sucrose 


10 


+ 2% glucose 


3 


+ 0.2% ammonium acetate 


26 


+ 2% ammonium acetate 


50 


+ 2% glucose + 0.2% ammonium acetate 


7 


+ 2% glucose + 2% ammonium acetate 


.15 


+ 2% ethanol 


28 



Glucose repressed reporter activity. Ethanol induced activity but not as strongly 
as acetate. The uninduced control also showed activity implying that aceB is self- 
regulated in this medium. 
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Example 5 

Use of the trc Promoter to Regulate Aspartokinase Activity when Integrated 
into the Chromosome of C. glutamicum (90-004002) 

The ability of the lacl q //rc (SEQ ID NO:30) regulatory region to control 
5 expression of aspartokinase, an enzyme in the C glutamicum L-lysine 

biosynthesis pathway (encoded by the ask gene), was tested as follows. The 131- 
2 and 131-5 C. glutamicum strains have the lacFV/re transcriptional regulatory 
region, in operable association with the ask gene, integrated into the C. 
glutamicum chromosome. These strains were prepared by introducing plasmid 
10 l31LacLVtrc-ask into C glutamicum by the methods described in Example 1. 

The level of aspartokinase activity was measured both with and without the 
addition of the inducer IPTG. The results are shown in Table 5. 



Table 5 



15 



Strain 


Regulator/promoter-gene 


Inducer 


nmol/min/mg 


131-2 


lacPltrc-ask 


none 


59 


131-2 


lacFitrc-ask 


+IPTG (30 mg/L) 


117 


131-5 


lacFltrc-ask 


none 


59 


131-5 


lacPltrc-ask 


+IPTG (30 mg/L) 


123 



The level of aspartokinase activity is doubled upon induction when the trc 
20 regulon is integrated into the chromosome. 

Example 6 

Lysine Production by Integrated, Uninduced lacF/trc-ask Constructs 

(84-004002) 

This example shows the influence on lysine production of the uninduced 
25 lacF/trc-ask construct when integrated into the chromosome of C. glutamicum. 

The laclVtrc regulatory region was used to control expression of aspartokinase 
(encoded by the ask gene), an enzyme in the C glutamicum L-lysine biosynthesis 



i 
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pathway. Bacterial cultures of 131-2 and 131-5, prepared as described in 
Example 5, were grown in shaker flasks. BF100 is C. glutamicum strain which 
is a high level lysine producer. The level of aspartokinase activity was measured 
without the addition of an inducer. The results are shown in Table 6. 



Table 6 



Strain 


Induction 


O.D. 


Titre 


Yield 


S.P. 


BF100 


none 


43 


23 


39 


53 


131-2 


none 


34 


27 


46 


82 


131-5 


none 


35 


28 


47 


82 



O.D. - optical density at 660nm 

Titre = grams Lysine/liter 

Yield = grams lysine made/grams dextrose consumed 

S.P. ~ grams lysine/O.D. 

The leakiness of the trc promoter in this organism may be responsible for 
the improved productivity in the absence of an inducer. 



Example 7 

Improvement of Lysine Production in Shake Flasks by aceB-Aspartokinase 
Constructs with No Exogenous Induction (84-0040002) 



This example shows the influence on lysine production of an aceB- 
Aspartokinase construct when integrated into the C. glutamicum chromosome. 
The C. glutamicum aceB regulatory region (SEQ ID NO:3) was used to control 
expression of aspartokinase (encoded by the ask gene), an enzyme in the C. 
glutamicum L-lysine biosynthesis pathway. Bacterial cultures were grown in 
shake flasks. Strain 131-6 is a C. glutamicum strain which has the aceB 
transcriptional regulatory region in operable association with the ask gene, 
integrated into the bacterial chromosome. Strain 131-6 was prepared by 
introducing plasmid \3laceB/ask, produced as described in Example 2(d), into 
C. glutamicum by the methods described in Example 1 . BF 1 00 is C glutamicum 
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strain which is a high level lysine producer. The level of aspartokinase activity 
was measured either with or without the addition of the inducer acetate. The 
results are shown in Table 7. 



Table 7 



Strain 


Induction 


O.D. 


Titre 


Yield 


BF100 


none 


43 


25 


37 




acetate 


42 


24 


36 


131-6 


none 


50 


27 


41 




acetate 


47 


27 


40 



Titer = g lysine/L medium 

Yield = g lysine/ g glucose consumed * 100% 



10 Induction by ammonium acetate is not required to observe an improvement in 

lysine production by this construct. 

Example 8 

Improvement of Lysine Production in Fermentors by aceB-aspartokinase 
Constructs with No Exogenous Induction (Lys002 Lys006RLK) 

1 5 This example shows the influence on lysine production of uninduced 

aceB-Aspartokinase constructs which are expressed from a plasmid 
(PDlOaceBask) replicating in G glutamicum, or which are integrated into the 
C glutamicum chromosome as in strain 131-6, described in Example 7. The 
G glutamicum aceB regulatory region (SEQ ID NO:3) was used to control 

20 expression of aspartokinase (encoded by the ask gene), an enzyme in the 

C glutamicum L-lysine biosynthesis pathway. Bacterial cultures were grown in 
fermentors. The C glutamicum 131-tfceB-aspartokinase strain, 131-6, has the 
aceB transcriptional regulatory region in operable association with the ask gene, 
integrated into the G glutamicum chromosome. The C glutamicum PD 1 0-# ceB- 

25 aspartokinase strain has the aceB transcriptional regulatory region in operable 

association with the ask gene, expressed from a plasmid (PD 1 QaceBask, produced 
as described in Example 2(e)) replicating in C glutamicum. BF100 is C 
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glutamicum strain which is a high level lysine producer. Aspartokinase activity 
was measured in each of these three strains.The results are shown in Table 8. 



Table 8 



Strain 


Titer 


Yield 


Product 


BF100 


127 


32.9 


735 


131 -aceB-aspartokinase 


129 


39.8 


833 


BF100 


138 


40.7 


769 


PD1 O-aceB-aspartokinase 


135 


45.8 


830 



Titer = g lysine/L medium 
10 Yield = g lysine/g glucose consumed * 100% 

Product = g lysine made in that fermented 

The aceB-aspartokinase constructs contributed to productivity whether they were 
maintained on a plasmid or integrated into the chromosome. In neither case did 
the constructs need to be induced by an exogenous compound. 

1 5 Example 9 

Influence of Medium on Beta Galactosidase Activity Using Various Use of 
LacZas a Reporter to Demonstrate Transcriptional Regulation by the 
Regulatory Regions Contained in SEQ ID NOs 1,2, and 4-22 in 

Corynebacterium 

20 The ability of certain transcriptional regulatory regions disclosed herein 

to control gene expression in C glutamicum was tested by measuring expression 
of p-galactosidase. The promoters used include (SEQ ID NO: 27) Tac, (SEQ ID 
NO: 1) Pta, (SEQ ID NO: 2) AceA, (SEQ ID NO: 3) AceB, (SEQ ID NO: 4) Adh, 
(SEQ ID NO: 5) AldB, (SEQ ID NO: 6) PoxB, (SEQ ID NO: 7) Ldh, (SEQ ID 

25 NO:9)MalZ,(SEQIDNO: 10)BglX,(SEQIDNO: 13)HisD,(SEQIDNO: 14) 

PyrR, (SEQ ID NO: 15) PurD, (SEQ ID NO: 18) DnaK, (SEQ ID NO: 20) GrpE, 
(SEQ ID NO: 21) ClpB, (SEQ ID NO: 33) LeuA, (SEQ ID NO: 34) IlvA and 
(SEQ ID NO: 35) IlvB. The reporter gene was expressed from plasmids 
replicating in C. glutamicum, and was driven by the regulatory regions contained 

30 in SEQ ID NOs 1,2, and 4-22. The plasmids were prepared as described in 
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Example 2(g). Increased expression of beta-galactosidase under the 
transcriptional control of these transcriptional regulatory regions is shown in 
Table 9. The "Seed" Medium is experimentally determined to support high 
growth. The "Main" Medium is formulated to induce higher expression of 
products, such as lysine, for example. Exemplary medias were described supra. 

For certain regulatory regions, p-galactosidase levels are measured 
without the addition or any inducer or in the presence of various inducers, e.g., 
acetic acid, pyruvate, ethanol, a starch subunit, a sugar, e.g., fructose, maltose, 
lactose or arabinose, a cellulose subunit, a fatty acid, or a triglyceride. In 
addition, certain regulatory regions are tested by growth with a suppressor, e.g., 
a purine, a pyrimidine, and amino acid, or oxygen, followed by a shift to medium 
lacking the suppressor at which time p-galactosidase levels were measured. For 
certain of the transcriptional regulatory regions, P-galactosidase levels are 
measured at optimal growth temperatures, and at temperatures either greater than 
or less than the optimal growth temperature. The P-galactosidase levels were 
measured using certain transcriptional regulatory regions at varying temperatures 
as shown in Table 1 0. 

* 

Table 9 

Influence of Medium on beta-galactosidase Reporter Activity 

(units = nmol/min/mg protein) 

Medium 

SEP ID NO Promoter Seedf 1 6hri Mainf48hrt Change 





Promoterless 


2 


20 


19 


27 


Tac 


231 


342 


111 


1 


Pta 


134 


111 


-23 


2 


AceA 


0 


63 


63 


3 


AceB 


0 


25 


25 


4 


Adh 


0 


65 


65 
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5 


AldB 


9 


260 


251 


6 


PoxB 


150 


276 


126 


7 


Ldh 


205 


291 


86 


9 


MalZ 


62 


234 


172 


10 


BglX 


57 


208 


151 


13 


HisD 


119 


241 


122 


14 


PyrR 


161 


289 


128 


15 


PurD 


9 


56 


47 


18 


DnaK 


61 


223 


162 


20 


GrpE 


40 


149 


109 


21 


ClpB 


104 


233 


129 


33 


LeuA 


174 


267 


193 


34 


IlvA 


120 


229 


109 


35 


IlvB 


22 


166 


144 



Table 10 

Effect of Temperature Change on beta-galactosidase Reporter Activity 
(units = nmol/min/mg protein; mean (standard deviation)) 
temperature shift in seed medium for 4 hrs 



SEO ID NO 


Promoter 


30°C 


36 °C 


40 °C 




Promoterless 


0(0.7) 


0(0.2) 


0(0.7) 


21 


clpB 


76(7.3) 


123(2.5) 


165(1.6) 


18 


dnaK 


70(0.8) 


119(3.1) 


169(4.5) 


20 


grpE 


47(1.6) 


107(1.6) 


97(2.6) 



All 3 promoters increased reporter activity at 36°C 
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All 3 promoters increased reporter activity at 40°C 

The grpE promoter doubled the activity at both 36 °C and 40 °C 

* 

Example 10 

Improvement of Lysine Production Through Transcriptional Regulation of 
5 Aspartokinase by Regulatory Regions Contained in SEQ ID NOs 1,2> 

and 4-22 

This example shows the influence on lysine production of induced or 
uninduced constructs in which the transcriptional regulatory regions of SEQ ID 
NOs 1,2, and 4-22 are used to express aspartokinase, either from a plasmid 

10 replicating in C. glutamicum, or when integrated into the C. glutamicum 

chromosome. The various C. glutamicum regulatory regions are used to control 
expression of aspartokinase, an enzyme in the C. glutamicum L-lysine 
biosynthesis pathway. Bacterial cultures are grown either in shake flasks or 
fermentors. C. glutamicum strains are prepared in which the transcriptional 

1 5 regulatory regions of SEQ ID NOs 1 ,2, and 4-22, in operable association with the 

ask gene, are integrated into the C. glutamicum chromosome. These strains are 
prepared by introducing plasmids 131 SEQ ID NO: 1-ask, 131 SEQIDNO:2-as£, 
and 13 1 SEQ ID NO:4-^through 1 3 1 SEQ ID NO:22-oyfc, prepared as described 
in Example 2(h), into C. glutamicum by methods described in Example 1. 

20 Additional strains, with self-replicating plasmids are prepared by introducing 

plasmids PD10/SEQ ID NO: 1-ask, PD10/SEQ ID NO:2-asfc, and PD10/SEQ ID 
NO A-ash through PD1 0/SEQ ID NO:22-a?&, produced as described in Example 
2(i), into C glutamicum by methods described in Example 1 . BF 1 00, a high level 
lysine producer, is used as a control. Aspartokinase activity is measured in each 

25 of the resulting strains. For certain regulatory regions, aspartokinase levels are 

measured, without the addition or any inducer, or in the presence of various 
inducers, e.g., acetic acid, pyruvate, ethanol, a starch subunit, a sugar, e.g., 
fructose, maltose, lactose or arabinose, a cellulose subunit, a fatty acid, or a 
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triglyceride. In addition, certain regulatory regions are tested by growth either 
without, or with a suppressor, e.g., a purine, a pyrimidine, and amino acid, or 
oxygen, followed by a shift to medium lacking the suppressor at which time 
aspartokinase levels are measured. Finally, for certain regulatory regions, 
aspartokinase levels are measured at optimal growth temperatures, and also at 
temperatures either greater than or less than the optimal growth temperature. 

*2fe ale ak sit 
*r *r *r *r 

Having now fully described the present invention in some detail by way 
of illustration and example for purposes of clarity of understanding, it will be 
obvious to one of ordinary skill in the art that same can be performed by 
modifying or changing the invention with a wide and equivalent range of 
conditions, formulations and other parameters thereof, and that such 
modifications or changes are intended to be encompassed within the scope of the 
appended claims. 

All publications, patents and patent applications mentioned in this 
specification are indicative of the level of skill of those skilled in the art to which 
this invention pertains, and are herein incorporated by reference to the same 
extent as if each individual publication, patent or patent application was 
specifically and individually indicated to be incorporated by reference. 
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Wliat Is Claimed Is: 

1 . An isolated polynucleotide comprising a nucleic acid at least 90% 
identical to a reference nucleotide sequence selected from the group consisting 



5 


\ a J 


SEO ID NO: 4 






SEO ID NO' 5, 




(c) 


SEO ID NO* 6 




(d) 
\yj 


SEO ID NO- 7 




(e) 


SEO ID NO* 8 


10 




SEO ID NO* 9 




\E>J 


SEO ID NO* 10 




O) 
(0 


SEQIDNO: 11, 
SEQIDNO: 12, 




0) 


SEQ ID NO: 13, 


15 


(k) 


SEQ ID NO: 14, 




(1) 


SEQIDNO: 15, 




(m) 


SEQIDNO: 16, 




(n) 


SEQIDNO: 17, 




(o) 


SEQIDNO: 18, 


20 


(P) 


SEQIDNO: 19, 




(q) 


SEQ ID NO: 20, 




(r) 


SEQIDNO: 21, and 




(s) 


SEQ ID NO: 22. 



2. The polynucleotide of claim 1, wherein said polynucleotide 
25 regulates transcription of P-galactosidase in a bacterial host cell. 

3. The polynucleotide of claim 1 5 wherein the sequence of said 
nucleic acid is identical to said reference nucleotide sequence. 
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4. The polynucleotide of claim 1 , further comprising a second nucleic 

acid. 

5. The polynucleotide of claim 4, wherein said second nucleic acid 
encodes a polypeptide. 

6. The polynucleotide of claim 5, wherein said polypeptide is 
selected from the group consisting of: 

(a) a polypeptide which is a component of an amino acid 
biosynthesis pathway; 

(b) a polypeptide which is a component of a purine nucleotide 
biosynthesis pathway; and 

(c) a heterologous polypeptide. 

• * * 

7. The polynucleotide of claim 6, wherein said polypeptide is a 
component of an amino acid biosynthesis pathway. 

8. The polynucleotide of claim 7 wherein said amino acid 
biosynthesis pathway is a lysine biosynthesis pathway. 

9. The polynucleotide of claim 7, wherein said polypeptide is 
selected from the group consisting of: 

(a) aspartokinase, 

(b) diaminopimelate dehydrogenase, 

(c) diaminopimelate decarboxylase, 

(d) dihydrodipicolinate synthetase, 

(e) dihydrodipicolinate reductase, 

(f) aspartate beta-semialdehyde dehydrogenase, and 

(g) pyruvate carboxylase. 
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10. A method of producing a vector which comprises inserting the 
polynucleotide of claim 1 into a vector. 

11. A vector comprising the polynucleotide of claim 1 . 

12. A vector comprising the polynucleotide of claim 4. 

13. A vector comprising the polynucleotide of claim 6. 

14. A host cell comprising the vector of claim 1 1 . 

15. The host cell of claim 14, wherein said host cell is a 
Corynebacterium species. 

16. A host cell comprising the vector of claim 12. 

17. A host cell comprising the vector of claim 1 3 . 

> 

18. A method of producing a transformed Corynebacterium species 
host cell comprising: 

(a) introducing into Coiynebacterium species cells the vector 
of claim 17, and 

(b) selecting said host cell. 

19. A method of production of a biosynthetic product, comprising 
culturing the host cell of claim 1 8 in or on a culture medium, and recovering said 
product. 
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20. An isolated polynucleotide comprising a nucleic acid which 
hybridizes to a reference nucleic acid, or the complement thereof, wherein the 
sequence of said reference nucleic acid is selected from the group consisting of: 



(a) 


SEQ ID NO: 4, 


(b) 


SEQ ID NO: 5, 


(c) 


SEQ ID NO: 6, 


(d) 


SEQ ID NO: 7, 


(e) 


SEQ ID NO: 8, 


(f) 


SEQ ID NO: 9, 


(g) 


SEQ ID NO: 10, 


(h) 


SEQ ID NO: 11, 


(i) 


SEQ ID NO: 12, 




SEO ID NO- 13 


(k) 


SEQ ID NO: 14, 


(1) 


SEQ ID NO: 15, 


(m) 


SEQ ID NO: 16, 


(n) 


SEQ ID NO: 17, 


(o) 


SEQ ID NO: 18, 


(P) 


SEQ ID NO: 19 ; 


(q) 


SEQ ID NO: 20 : 


(r) 


SEQIDNO:21 : 


(s) 


SEQ ID NO: 22 



wherein said first nucleic acid is at least 30 nucleotides in length. 

21. The polynucleotide of claim 20, wherein said polynucleotide 
regulates transcription of p-galactosidase in a bacterial host cell 

22 . An isolated polynucleotide comprising a nucleic acid, the sequence 
of which comprises 10 contiguous nucleotides of a reference sequence selected 
from the group consisting of: 
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(a) 


SEQ ED NO: 4, 


(b) 


SEQ ID NO: 5, 


(c) 


SEQ ID NO: 6, 


(d) 


SEQ ID NO: 7, 


(e) 


SEQ ID NO: 8, 


(f) 


SEQ ID NO: 9, 


(g) 


SEQ ID NO: 10, 


(h) 


SEQ ID NO: 11, 


(i) 


SEQ ID NO: 12, 


0) 


SEQ ID NO: 13, 


(k) 


SEQ ID NO: 14, 


(1) 


SEQ ID NO: 15, 


(m) 


SEQ ID NO: 16, 


(n) 


SEQ ID NO: 17, 


(o) 


SEQ ID NO: 1 8, 


(P) 


SEQ ID NO: 19, 


(q) 


SEQ ID NO: 20, 


00 


SEQ ID NO: 21, and 


(s) 


SEQ ID NO: 22. 



23. The polynucleotide of claim 22, wherein said polynucleotide 
regulates transcription of (3-galactosidase in a bacterial host cell. 

* 

24. The polynucleotide of claim 22, wherein the sequence of said first 
nucleic acid comprises 20 contiguous nucleotides of any of said sequences. 



25. The polynucleotide of claim 24, wherein the sequence of said first 
nucleic acid comprises 50 contiguous nucleotides of any of said sequences. 
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26. The polynucleotide of claim 25, wherein the sequence of said first 
nucleic acid comprises 150 contiguous nucleotides of any of said sequences. 

27. The polynucleotide of claim 24, further comprising a second 
nucleic acid. 

2 8 . The polynucleotide of claim 27, wherein said second nucleic acid 
encodes a polypeptide. 

29. The polynucleotide of claim 28, wherein said polypeptide is 
selected from the group consisting of: 

(a) a polypeptide which is a component of an amino acid 
biosynthesis pathway; 

(b) a polypeptide which is a component of a purine nucleotide 
biosynthesis pathway; and 

(c) a heterologous polypeptide. 

30. The polynucleotide of claim 29, wherein said polypeptide is a 
component of an amino acid biosynthesis pathway. 

31. A method of producing a vector which comprises inserting the 
polynucleotide of claim 22 into a vector. 

32. A vector comprising the polynucleotide of claim 22. 

33 . A vector comprising the polynucleotide of claim 27. 

34. A host cell comprising the vector of claim 32. 



t 
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35. The host cell of claim 34, wherein said host cell is a 
Corynebacterium species. 

36. A host cell comprising the vector of claim 33. 

37. A method of producing a transformed Corynebacterium species 
5 host cell comprising: 

(a) introducing into Corynebacterium species cells the vector 

♦ 

of claim 33, and 

(b) selecting said host cell. 

38. A method of production of a biosynthetic product, comprising 
culturing the host cell of claim 36 in or on a culture medium, and recovering said 
product. 

* 

39. An isolated Corynebacterium species chromosome, comprising: 
a first nucleic acid integrated into said chromosome, the sequence 

of which is at least 90% identical to SEQ ID NO:l, and 

a second nucleic acid integrated into said chromosome, wherein 
said second nucleic acid encodes a polypeptide which functions as a component 
of an amino acid biosynthesis pathway; and 

wherein said first nucleic acid regulates transcription of said 
second nucleic acid. 

20 40. The Corynebacterium species chromosome of claim wherein the 

sequence of said first nucleic acid is identical to SEQ ID NO:L 

41. A host cell comprising the Corynebacterium species chromosome 
of claim 39. 



10 



15 
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42. A method of producing the Corynebacterium species host cell of 
claim 41 , comprising: 

(a) transforming Corynebacterium species cells with a vector 
comprising a first nucleic acid, the sequence of which is at least 90% identical to 
SEQ ID NO: 1 , wherein said vector facilitates integration of said first nucleic acid 
into the chromosome of said Corynebacterium species cells, and 

(b) selecting said host cell. 

43. A vector comprising the first and second polynucleotide of 
claim 39. 

44. The vector of claim 43, wherein said second nucleic acid encodes 
a polypeptide which functions as a component of an amino acid biosynthesis 
pathway; and 

wherein said first nucleic acid regulates transcription of said second 
nucleic acid. 

45. A method of production of an amino acid, comprising culturing 
the host cell of claim 41 in or on a culture medium, and recovering said amino 
acid. 

46. An isolated polynucleotide comprising: 

a first nucleic acid, the sequence of which is at least 90% identical 
to SEQ ID NO: 2, and 

a second nucleic acid operably associated with said first nucleic 
acid, wherein said second nucleic acid encodes polypeptide which functions as 
a component of a lysine biosynthesis pathway; 

wherein said first nucleic acid regulates transcription of said 
second nucleic acid. 
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47. The isolated polynucleotide of claim 46, wherein the sequence of 
said first nucleic acid is identical to SEQ ID NO:2. 

48. A method of producing a vector, comprising inserting the 
polynucleotide of claim-47 into a vector. 

49. A vector comprising the polynucleotide of claim 46. 

50. A host cell comprising the vector of claim 49. 

51. A method of producing a transformed Corynebacterium species 
host cell comprising: 

(a) introducing into Corynebacterium species cells the vector 
of claim 49, and 

(b) selecting said host cell. 

52. An isolated polynucleotide comprising: 

a first nucleic acid, the sequence of which is at least 90% identical 
to SEQ ID NO: 3; and 

a second nucleic acid operably associated with said second nucleic 
acid, wherein said second nucleic acid encodes polypeptide which functions as 
a component of an amino acid biosynthesis pathway; 

wherein said first nucleic acid regulates transcription of said 
second nucleic acid. 

53. The isolated polynucleotide of claim 52 wherein the sequence of 
said first nucleic acid is identical to SEQ ID NO:3. 

* 

54. A method of producing a vector, comprising inserting the 
polynucleotide of claim 52 into a vector. 
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55. A vector comprising the polynucleotide of claim 52. 

56. A host cell comprising the vector of claim 55. 

57. A method of producing a transformed Corynebacterium species 
host cell comprising: 

(a) introducing into Corynebacterium species cells the vector 
of claim 55, and 

(b) selecting said host cell. 

58. A method of production of an amino acid, comprising culturing 
the host cell of claim 56 in or on a culture medium, and recovering said amino 
acid. 

* 

59. An isolated Corynebacterium species chromosome, comprising: 
a first nucleic acid integrated into said chromosome, the sequence 

of which is at least 90% identical to SEQ ID NO:23, and 

a second nucleic acid integrated into said chromosome in operable 
association with said first nucleic acid, wherein said second nucleic acid encodes 
polypeptide which functions as a component of an amino acid biosynthesis 
pathway; 

wherein said first nucleic acid regulates transcription of said 
second nucleic acid. 

60. The Corynebacterium species chromosome of claim 59, wherein 
the sequence of said first nucleic acid is identical to SEQ ID NO:23. 

61. A Corynebacterium species host cell comprising the 
Coiynebacterium species chromosome of claim 59. 
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62. A method of producing the Corynebacterium species host cell of 
claim 61, comprising: 

(a) transforming Corynebacterium species cells with a vector 
comprising a first nucleic acid, the sequence of which is at least 90% identical to 
SEQ ID NO: 23, wherein said vector facilitates integration of said first nucleic 
acid into the chromosome of said Corynebacterium species cells, and 

(b) selecting said host cell. 

63. A vector comprising the Corynebacterium species chromosome 
of claim 59. 

64. A method of production of an amino acid, comprising culturing 
the host cell of claim 61 in or on a culture medium, and recovering said amino 
acid. 

65. The method of claim 64, further comprising adding arabinose to 
said culture medium. 

66. The method of claim 65, wherein said first polynucleotide 
facilitates enhanced production of said amino acid in the presence of arabinose. 

67. An isolated Corynebacterium species chromosome, comprising: 
a first nucleic acid integrated into said chromosome, the sequence 

of which is identical to a nucleotide sequence selected from the group consisting 
of 

(a) SEQ ID NO:26, and 

(b) SEQ ID NO:27, 

a second nucleic acid integrated into said chromosome in operable 
association with said first nucleic acid, wherein said second nucleic acid encodes 
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■ 

polypeptide which functions as a component of an amino acid biosynthesis 
pathway; and 

a third nucleic acid, the sequence of which is at least 90% identical 
to SEQ ID NO:28; 

wherein said first nucleic acid regulates transcription of said second 
nucleic acid, and wherein said third nucleic acid encodes a lac repressor 
polypeptide, wherein said third nucleic acid is in operable association with said 
first nucleic acid. 

68. The Corynebacterium species chromosome of claim 67, wherein 
the sequence of said third nucleic acid is identical to SEQ ID NO:28. 

69. The Corynebacterium species chromosome of claim 67, wherein 
said third nucleic acid is operably associated with a transcriptional regulatory 
region. 

70. A Corynebacterium species host cell comprising the 
Corynebacterium species chromosome of claim 67. 

71 . A method of producing the Corynebacterium species host cell of 
claim 70, comprising: 

(a) transforming Corynebacterium species cells with a vector 

comprising: 

(i) a first nucleic acid, the sequence of which is 
selected from the group consisting of SEQ ID NO:26 and SEQ ID NO:27; and 

(ii) a third nucleic acid, the sequence of which is at 
least 90% identical to SEQ ID NO:28, operably associated with a transcriptional 
regulatory region, wherein said third nucleic acid encodes a lac repressor 
polypeptide; and 
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wherein said vector facilitates integration of said first and third 
nucleic acids into the chromosome of said Corynebacteriwn species cells, and 

(b) selecting said host cell. 

72. A vector comprising the first, second and third nucleic acid of 
claim 67. 

73. A method of production of an amino acid, comprising culturing 
the host cell of claim 70 in or on a culture medium, and recovering said amino 
acid. 
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SEQUENCE LISTING 

<110> Rayapati, P. John 
Crafton, Corey M. 

<120> Nucleotide Sequences for Transcriptional Regulation in 
Corynebacterium glutamicum 

<130> 1533.194PC02 

<140> 
<141> 

<150> 60/248,219 

<151> 2000-11-15 

<160> 35 

<170> Patentln version 3.0 

<210> 1 
<211> 500 
<212> DNA 

<213> Corynebacterium glutamicum 
<400> 1 

caccgacaac ggcaacacgc aaagggcgag acatataaag ttcgattcct taaaggggtt 60 
ctaaaaaatg tggagtatgt gagcggggtt ccactagtag attcgactcc tatcggggtg 120 
cgactgctaa tggtgccctg ctatcaaccc tccatgatac gtggtaagtg cagactaata 180 
aaggccagtc ggggaggatt gggggctttg ctgggggcag atttgtcacg ctgcgcgctt 240 
tcatagaccc cattaatgtg gggtgaagag ctgtaaagta ccgctaaaaa ctttgcaaag 300 
ggtgcttcgc aacttgtaac cgctccgtat tgttttctac ggcaataagc atttgtgctg 360 
ctcaaagcgt ggaattgaga tcggtttgaa aattacaaaa taaaactttg caaaccgggc 420 
tgtacgcaag gcggacgaac gctaaactat gtaagaaatc acaacttccc ctcagtagtg 480 
ccaggaggca caagcctgaa 500 
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<210> 2 
<211> 500 
<212> DNA 

<213> Corynebacterium glutamicum 



<400> 2 
actcttttaa 


gaaaagcact 


ctgactacct ctggaatcta ggtgccactc ttctttcgat 


60 


ttcaaccctt 


atcgtgtttg gcgatgtgat 


cagactaagt 


gatcaccgtc 


accagcaaaa 


120 


ggggtttgcg 


aactttacta 


agtcattacc 


cccgcctaac 


cccgactttt 


atctaggtca 


180 


caccttcaaa 


acctacggaa 


cgttgcggtg 


cctgcatttt 


cccatttcag 


agcatttgcc 


240 


cagtacatct 


gtactagcaa 


ctcccccgcc 


cactttttct 


gcgaagccag aactttgcaa 


300 


acttcacaac 


aggggtgacc 


acccccgcac 


aaaacttaaa 


aacccaaacc 


gattgacgca 


360 


ccaatgcccg 


atggagcaat 


gtgtgaacca 


cgccaccacg 


caaaccgatg 


cacatcacgt 


420 


cgaaacagtg 


acagtgcatt 


agctcatact 


ttgtggttgg 


caccgcccat 


tgcgaatcag 


480 


cacttaagga 


agtgactttg 










500 



<210> 3 
<211> 500 
<212> DNA 

<213> Corynebacterium glutamicum 
<400> 3 

ttgcgtggtg gcgtggttca cacattgctc catcgggcat tggtgcgtca atcggtttgg 60 
gtttttaagt tttgtgcggg ggtggtcacc cctgttgtga agtttgcaaa gttctggctt 120 
cgcagaaaaa gtgggcgggg gagttgctag tacagatgta ctgggcaaat gctctgaaat 180 
gggaaaatgc aggcaccgca acgttccgta ggttttgaag gtgtgaccta gataaaagtc 240 
gg<?9ttaggc gggggtaatg acttagtaaa gttcgcaaac cccttttgct ggtgacggtg 300 
atcacttagt ctgatcacat cgccaaacac gataagggtt gaaatcgaaa gaagagtggc 360 
acctagattc cagaggtagt cagagtgctt ttcttaaaag agttttcaca accgttaacg 420 
gcgtagccaa acaagaagga ttcgcattct tctggtttag gcacaggtca tctaaaccca 480 
tgctttaaaa ggagccttca 500 



<210> 4 
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<211> 500 
<212> DNA 

<213> Corynebacterium glutamicum 



<400> 4 
attggaaaat 


ggagattaga 


agaatcctgg gaatgttggt 


gtgtgttgca 


tgtctgttga 


60 


gactatctag 


tagatgcggc 


tgcgctcctt aattgcatgc 


tggggtggtg 


gggaatgggt 


120 


ggttgggggc 


gtcgaaaagc 


atttttggtg cttctaaggg 


aattgtgtga 


atcttggaaa 


180 


gctaattgaa 


aaacattccc 


attagtgggt gatttgctgg 


agttttgtga 


atctattttt 


240 


cgaaatttca 


gcgtgcgggg 


gtggtttgtt tttttacaat 


tgccagttca 


ttcacggttg 


300 


ttgaaatgtt 


cgggggtaat 


aactcaactt tctattttca 


ccttgttggg 


atttcgctag 


360 


ggtggacgat 


ggcagcaatt 


gaatgttgta aatcacacaa 


ttgcaaggat 


tgtaatttaa 


420 


ggcacatcta 


tgtcggtgtg 


aaattacatg tgccagaaga 

■ 


gcaatttgcc 


aagtaatcca 


480 


agcgagaagg 


agtgagtttt 








500 


<210> 5 






■ 






<211> 500 












<212> DNA 












<213> Corynebacterium glutamicum 








<400> 5 
cgatcatcga 


actcggcgaa 


gaaaaccgaa acctcaaaga 


atccctgcgt 


* 

aaggtcacag 


60. 


ctgagaatga 


gcagctcaaa 


gatcaattac gcagcgggcg 


tccgcgtggc 


gagctggtgc 


120 


acgtgccccg 


ctccaccgcg 


gtggtcatgt gggaacgccg 


caaggggcgt 


tccaagtaaa 


180 


aacatgcttg 


tcgacgccgc tttctagcaa attaagcggg 


cacctccatt 


tatcttttgg 


240 


aggtgcccgt 


ttcgtgcttt 


cgccaattag atacatgcat 


aaccacccga 


acaggggtaa 


300 


taacttttga 


aaggctttcg gcgttgagct gcgagaattt 


tgagaaaagg 


gggtgaattt 


360 


aacaggggtt 


ctagcgcgga 


ttgattttcg tgaatatggt 


ggctgctaag 


cgtgcgaatg 


420 


tgcgcgttat 


cacaatcgtt 


gaccaagtgt cacctgacgc 


acaggtagtg 


ctcaggtgga 


480 


ggtggcccaa 


aggagaccca 








500 



<210> 6 
<211> 500 
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<212> DNA 

<213> Corynebacterium glutamicum 



<400> 6 
ttttagacca 


cggcgctgtg 


tggggattta agacgtcgga aattgtaggg gactgtcagt 


60 


gtgagtcggg 


ttctttgagg 


cgcttagagg cgattctgtg aggtcacttt ttgtggggtc 


120 


ggggtctaaa 


tttggccagt 


tttcgaggcg accagacagg cgtgacaaga ttgactaaaa 


180 


aaccgaagtt 


ttggcacgtg 


tgtttggttt ctcggggtct aaaccggaca ggcgtgacaa 


240 


gatctggcga 


aatcgcaggt 


ttttgtcacg cctgtctggt tttacctttt gggggcccga 


300 


actgccctga 


actactcgga tcgaccaagc agtttggcct ccagcgctct gatcaagcac 


360 


ccaaccgcct 


ctaaatcaca 


ccaaggcact cgtaaaaccc gtggcagata gagaaagtgt 


420 


ggcagcaact 


cgaattgaag 


agcacaattg aagtcgcacc aagttaggca acacaatagc 


480 


cataacgttg 


aggagttcag 




500 



<210> 7 

<211> 500 

<212> DNA 

<213> Corynebacterium glutamicum 



<400> 7 








aaaacagcca 


ggttagcggc tgtaacccac cacggtttcg gcaacaatga 


cggcgagaga 


60 


gcccaccaca 


ttgcgatttc cgctccgata aagccagcgc ccatatttgc 

■ 


agggaggatt 


120 


cgcctgcggt 


ttggcgacat tcggatcccc ggaaccagct ctgcaatgac 


ctgcgcgccg 


180 


agggaagcga 


ggtgggtggc aggttttagt gcgggtttaa gcgttgccag gcgagtggtg 


240 


agcaaagacg 


ctagtctggg gagcgaaacc atattgagtc atcttggcag 


agcatgcaca 


300 


attctgcagg 


gcatagattg gttttgctcg atttacaatg tgattttttc 


aacaaaaata 


360 


acacttggtc 


tgaccacatt ttcggacata atcgggcata attaaaggtg 


taacaaagga 


420 


atccgggcac 


aagctcttgc tgattttctg agctgctttg tgggttgtcc 


ggttagggaa 


480 


atcaggaagt 


gggatcgaaa 




500 



<210> 8 
<211> 500 
<212> DNA 
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<213> Corynebacterium glutamicum 



syUll/ 0 

ccagatctcg 


ggatgcttcg tagatcagca 


tggcgccgag gtcgttggct gctgcacgga 


60 


aagctgcgtt 


gtcgctgcgc tcgtcgcgca 


acaggggtag gcggctagca acgagtgggt 


120 


ggttgacgat 


ggtgatgtcc atagcttcac 


atgttaaatc attgccgccc agaagaagac 


180 


cgcgcgggcg 


aatttgggct tggagggaac caaacggcca cttttccagt ccaacaaagt 


240 


atgaggatta 


atttgcccca ctccaaagag 


ctcgcccacg agctgtgttt gttgcc.cacc 


300 


cctgctgtgc 


ccgcgcttcc cactgattct 


ggcgcgcagt ttgatatcca ccaggcactg 


360 


tccgcctctc 


ttgccaccta tgcccgcaac 


ctcaccttgc tgtcccacac cgccgagaat 


420 


ttaggaaacc 


gcgctctgac cggcctcgct gaaatcgaag acaccgacga ccaactcgca 


480 


cacgcattgg 


agcgcctgac 




500 



<210> 9 

<211> 1583 

<212> DNA 

<213> Corynebacterium glutamicum 



<400> 9 
gggcatcatg 


ggcattgttg 


gtggattcac 


catgcctgtc gcaatcgcta ggaccaggga 


60 


taagaacctc 


gtgtggttcc 


ccgtggtctt 


tggtgcatcg atgttcctcg gttatgtggg 


120 


aacgtggctg 


tggccgtccc 


aaggctggta 


cctgtggtca ttccttcttg gtttaggtgg 


180 


actctgcttc 


ccgatggcta 


tcgccctgat 


tccagcgcgt acgaaagatc cgagaattac 


240 


cgcaagcttg 


tctggatttg 


tgcagccggt 


gggttacatt cttgcagccc ttgggccatt 


300 


ggcagtggga 


gcgatctacc 


aggcgattgg 


ctcctggtca gagatcctcg ttggtttggc 


360 


cttgggcaca 


atagtgttgt 


cgattgtggg 


attcagagca gcacgcaatg tgacggttga 


420 


tgatgaattg 


aggagatcaa 


agtagcctca 


actaagcgtc gcgataagaa cgaggggcaa 


480 


ggctgatgta 


ctctgtcaac 


catggataaa 


ccggtcgtga gggatgcagc tctgctgatt 


540 


tttcgcgctg 


tgctcggagt 


gatctttgtg 


gcacacgggt gggaaaagct gttcatctcc 


600 


ggagttacca 


agacaacagg 


acaattttca 


gcctggggag tgcctcaacc caagctctcg 


660 


gtgtggatca 


catcgatctc 


tgagctgctc 


ggtggtgcct tcctagtggt tggtttgctc 


720 


accacctttg 


ttgctggtgc 


attagcgctg 


ttgatcgccg ctgctattta ctttgtgcac 


780 


ttgagttcgg 


gctttttcac 


agttgataac 


ggcatcgaat tccccttgct catcattgtt 


840 
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tctttgctcg' 


tgatcgttgt 


gtttggttct 


ggtagagcca 


gcgttgatgg 


ggtgctcacg 


900 


cgtggttgac 


tgtagtgcca 


ttcaagccgc 


gctgtccgcc 


aaattagatg 


gtgagccgac 


960 


aggcttggat 


gatgcagtaa 


ttgaagcgca 


cctcgctaat 


tgtgaagagt 


gcagaaatta 


1020 


ctacaaccgt 


gctgctgagt 


tgaatcggat 


gctcaatttt 


tgcgccgcgg 


aacctcgcac 


1080 


cctgaccccg 


cctgatctat 


cagagatcat 


tctggcagag 


gtggaaccag 


aatggcgcag 


1140 


gcatgccaac 


gccaaggttg 


tgggatccct 


gctatcgcga 


gtgttgttgg tgatcctggg 


1200 


tgtggtttac 


ctcgcctggg 


gtatcacaat 


gttgggggat 


tcggcgtcga 


taagcgtcca 


1260 


agaagacccg 


ctcacctcgc 


gcctgctcgc 


ggaggccgtt 


gcctaccgca 


ttgctctttc 


1320 


tgtggggctg 


ttatttgcgg 


cgtggaagcc 


gcggattatc 


gcgggcatgc 


tcccgatttt 


1380 


tggaacgttg 


tggacattta 


gtgctggttt 


tgctgcgcgc 


gatctcgtgt 


ttggcgtcgc 


1440 


cgattcacag 


acgggactgt 


ccattggtct 


gctgttgatt 


tctacgattg 


tgctgtcgtt 


1500 


tgcgttggtg 


aatagttctg 


gaccgggtat 


tttgcggcgc 


acatggaact 


cattgaacgc 


1560 


cgcgcccggc 


taaggtggga 


ggc 








1583 



<210> 10 

<211> 500 

<212> DNA 

<213> Corynebacterium glut ami cum 



<400> 10 

taatacggca tccggcctga aaggacagtg tggaacccaa aaatcatgcc ctcgcagaat 60 

cgtgtttaag gggttaaaac gcctcgaccc acactctgac ccatccgtga aactagacct 120 

cttaaaacga cttctggttg gtgaggtgtg aaaaccgcac tgcggggccc aattcacaaa 180 

aatccatacg agtgtgcaca cccgattttc acatcgcttc gagacctccc tttttgacac 240 

cttaattgtc taaccccgta taggtgagaa atgttggaca agtgtctgtt tttgtggggg 300 

gaatctgact acgatggtaa gaaacaggga aaggggttac cattatgtct caagagtagc 360 

ctcaaatcgg ctcccgcctc tctcgtgtca ttgaacaaga cggcctacaa ttccgcgatc 4 20 

' tcgacggcga cggcgtactt gcaccttatg aagattggcg tctaacccca gcagagcgtg 4 80 

ccgctgacct ggtgaaacga 500 

<210> 11 
<211> 500 
<212> DNA 
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<213> Corynebacterium glutaraicum 
<400> 11 

caccgcgtcg ccgatctctc ccttccagac tcagaagttg cttcggcaat tgcacttgag 60 

tggaaaccag gcgtcggatb cactacttga gctgctgatt tgtaggtttt aagaccttga 120 

agatatagtt aattctcgtt gcaaggaacc agattccaag caatgcgtat tcctccatag 180 

ctcagttggc agagcattcg actgttaatc gaagggtcac tggttcgagc ccagttggag 240 

gagcaaattg aaacccactg ttttttaaca gtgggttttt ttgcatgttt catacagtta 300 

acgaacccca attttgtaac ttccactctc ctagctatga tgaatactcg ttgcaaggaa 360 

gtaattcctt tccaatactt attcctccat agctcagttg gcagagcatt cgactgttaa 420 

tcgaagggtc actggttcga gcccagttgg aggagcaata cacacagccc gccgtttttc 480 

ttaaaacggc gggttttgtt 500 

<210> 12 

<211> 1083 

<212> DNA 

<213> Corynebacterium glutamicum 



<400> 12 
ggtgtcagtg 


tatccgcgaa 


ccagagcctc 


gatggcagct gtgtcttctg gggaagcggt 


60 


ctggccgttt 


tcaactggcg 


cagcttcagc 


gaaggtaatt gggttctcag acagctcttg 


120 


tgacagtaca 


tcaagctgtg 


cttgctgctc 


ttcgttgatg gttgttgctt cggaagattc 


180 


agattcagat 


tcagattcgg 


acgaggtggt 


ttctgctgct tccgccgagg tcgttgcgct 


240 


ggaagaggaa 


gaatttgtcg 


tcgtcgctgc 


gctggatgag gctgcttctg tatcagatga 


300 


ctcactgcta 


catgcggtta 


agagaagtgg 


ggtgaccatg agtgctgcga aagcagcctt 


360 


ctttgaggaa 


aggcgaatag 


acaaagttct 


gctcctgata aatcatcgac atgctccgga 


420 


aaacttaaaa 


attcccggac 


ggttcacgca 


gattacccta gcaaagcaat ctagctgacg 


480 


acccaattta 


gtcctgtcat 


tatgctggca 


attgtgcagc tatctaaaga atctattatt 


540 


ggggcagccg 


tttcgatcct 


gagcgagttc 


ggtttgtcgg atatgaccat gcgtcgcgtc 


600 


gcaaagcaat 


taaatgtcgc 


gccgggcgcg 


ctgtattggc attttaaaaa taagcaggag 


660 


cttatcgacg 


ccacctcacg 


ccatctcctg 


gcgcctatct tggggcgcaa cgacgagcag 


720 


cgagcaagca 


tttccgcgca 


ggaaacgtgc 


gcagaaatgc gttcactgat gatgcaaacc 


780 


aaagacggtg 


cggaagtcat 


cagtgccgca 


ctgagtaatc agcaactgcg ccaagaattg 


840 
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gaatctctca tttccgactc tttaaaagaa cctaatgagg 


tcggtgcttt 


tacgctgctg 


900 


cattttgtgg tgggtgcagt attaacagaa caaactcagc 


tgcagatgca 


cgagttcacg 


Q a f\ 

you 


gctggcgcgg 


aagatgacac acaagaaaac cctgccgatg 


cgaactttga 


ggagagattc 


1 C\ O A 


aatcaaggat 


tagaaatcat tctggcgggt ctagacgcgc 


ttgggcatat 


aagatagcgt 




tct 


* 






1083 


<210> 13 










<211> 500 










<212> DNA 










<213> Corynebacterium glut ami cum 








<400> 13 
agtgacagct 


cgcgccgcat cgttgatggc aaacatgtcc 


aaattaaggc 


gcatgcgacc 


60 


aaggatggtg 


agcatttgcc ctgattcgtg gctgaagtac 


atcgaaatgc 


ggtgatcttg 


120 


ccacggcacg 


atgatgcgat cttctgagct caaatagtgg 


tagcccaagg 


aatcaacagc 


180 


ctccgtcact 


cgattcaggt caactgggaa aggaatggag 


gtgttgggtg 


cagggacgtt 


240 


tcgatcattc 


acagaggtga aatccattcc ttccagtatc 


tcaaaggtga 


aagcgggttt 


300 


aattcaggta 


aatctggggt ggtcatttta agttttaagt 


ctaattcaaa 


tgaactctga 


360 


tgtacccaaa 


tcagaaactt gttacgtggg gaatacaata 


ggtaaatatg 


cgggcttaag 


420 


aacttgtgtt 


gaggccgctt ggattcgggc accgagctcg 


aagaatttcg 


attcaacctt 


480 


ttaagggaga 


acttttcgcc 






500 


<210> 14 










<211> 500 










<212> DNA 










<213> Corynebacterium' glutamicum 


• 


• 




<400> 14 

cgtggcgtgc cacccattta agtcccgcgg ggagactgaa 


gatggtggtg 


ccgtcggcga 


60 


ggcgttctgc ctgcgtaatg gggttaagtg ggatgaacgg 


gggagtcaga 


cgtgcgacag 


120 


cgcccttgcg ggtatgccaa tcccagacca tttctcgggg 


aaaaggaata 


aaatggcttg 


180 


tggtcagact 


cacaggggct tctccaagtc agtggattta 


tgaggtccca 


gtgggtacac 


240 


accgggtgtc 


ctacaacgat caattgtcac agattcgact 


ggcatgctgt 


accatctgct 


300 
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ttaagcattt tggtgtttca ctgttgttaa cagtgtttca ccgtggagca ctaccttaga 360 

tcatagtcag catcttgggg tgaatgtgac acggtacgct atagtgtcag acaacaacca 420 

ggaaactggt cgttgcagag tttttgcaaa attggacatc ctttaacgga ccgcacagag 480 

aggcggggaa ggaggtcacg 500 

<210> 15 
<211> 500 
<212> DNA 

■ 

<213> Corynebacteriuiti glutamicum 



<400> 15 
gccatggcgt 


tgcggaaatc gtaatcggcc attttgtcgg tggggaagag gtggatgtga 


60 


gtgtggggaa 


catcgaatcc 


tgcgatgatg taaccacatc gaggggcgtc gaatgctgtg 


120 


cggattgcat 


ttccgatgag 


ctgggaggcc tcgtttactt cgctccagat " gttctgagga 


180 


aggtcggtcc 


agcggtcaac 


ttctgcaacg ggtacgacta gggtgtggcc gtaggtgagg 


240 


ggttcgatgg 


atagaaaagc 


cacgacattc tcggaacgat acacaaatcg gccggggagc 


300 


tcgccattaa 


taattttcgt 


gaatacagaa gccatatgca cagactacta cttggcgtgc 


360 


aaccaaattg 


aggttgcata 


aaataatgca ggtgagccgg tcattttaag gcgcttttcg 


420 


acgccacttt 


caaccatttc cgaaccgcca agaatactgg aatagcttgg atcaagtttt 


480 


gcaggataaa 


ctgtgcaacc 


* 


500 



<210> 16 
<211> 500 
<212> DNA 

<213> Corynebacterium glutamicum 
<400> 16 

acgctgatcg ttttgagctt atcgacgctc gcctgcgctc agctggtttc gattggtacg 60 
aggtatccaa ctgggcgaaa cccggcggag aatgcaagca caacatgggc tattgggtcg 120 
acggcgactg gtggggtgct ggcccgggcg cgcactcgca catcggcgac cgccgcttct 180 
acaacatcaa gcacccagcg cgttactccg cgcagattgc ggccggcgag ctgcccatta 240 
aggaaacaga gcggctgacg gcggaagatc accacaccga gcgcgtcatg cttggtttgc 300 
gcctgaaaca aggcgtgccg ctgaaccttt tcgcacccgc agcgcgcccg gtcatcgacc 360 
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gtcatatcgc aggaggcctg ctgcacgtca atgcgctggg caacctggcg gtgaccgatg 420 

cgggacgttt gcttgccgac ggcatcatcg ccgacatttt gcttagtgaa gaagactaaa 4 80 

tatttagtag ggttacagac 500 



<210> 17 

<211> 500 

<212> DNA 

<213> Corynebacterium glutamicum 



<400> 17 
tggcgactat 


gccacatagt 


cgactacctt gcatagttga ctatttgatt 


gagttgaact 


60 


gtgtcagtgt 


atgaagagaa aaatgaagag aaaaacagct gcacatggtt 


tcaaaagata 


120 


aggaagtgaa 


gcatgagcat 


cgagccagga atccccacgc ttggaccgct tgaagaacaa 


180 


gtcatgcaca 


ttctgtggga 


tcacggaaaa ttgacagtcc gtgaagtcat 


cgaattcctt 


240 


ccaggtgatc 


ctgcgtacac 


aacgatcgca accgtcctgc gtcacttggg 


cagaaaaggc 


300 


atggtcacca 


ttgtgaaaga 


tggtcggact gctcgacaca gcgcgttgat 

► 


gaacagggaa 


360 


gaatacaccg 


ctggcgtcat 


ggatcaggtg ctgtcgacca gtcgggatcg caccgcatca 


420 


attctgcatt 


tcgtggatac 


gatcacggcg actgatcgcg agctgcttct 


ggagtatctg 


480 


caacagcagg 


agggcaggaa 






500 



<210> 18 

<211> 297 

<212> DNA 

<213> Corynebacterium glutamicum 



<400> 18 

agacttggcg tcaaagttga gtgggacctg ttgaaaacag ctcgtatctt ccctgattgg 60 

gtggttgaaa ttaggggtaa acccggattt tttctcaagt gaagcgcttt gacctgtgta 120 

aattaagaaa gttgagtcta gtgggaacaa ctttgtggca tttaccgttg ccatatatgt 180 

aagcttgagt caggcaggct caatgaggag tttttcttac cggcgaaagt cggtggaagc 240 

aagtcaaagc tcaagccgtg gacaatacta aaatcaccta aaacaggagg caccatt 297 



<210> 19 
<211> 500 
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<212> DNA 

<213> Corynebacterium glutamicum 



<400> 19 
caoaaaccat 


tcttctccac 


gatggggcgc 


tcggtgaaat 


cactgaaacg 


cqcqgtttcc 


60 


agaagcatca 


ccacgatggc 


gtcactgacc 


tcaaaagatg 


cagtgtgctc 


atcactgaag 


120 


acggtgtttt 


ctttgaagcc 


aagacctgca 


taaaaaacgc 


ttagatgctg 


caaggtcaga 


180 


tactggtagg 


ttgatgaaaa 


tcatgtcgtg 


ctgaagtcgt 


gccatgggag 


ttccacttcc 


240 


tcaaagtgcc 


ttttggctaa 


tgtgacccca 


accagtatgc 


tgccggtttt 


tggattaggt 


300 


ttggccatcg 


tgctagcata 


tttaggtctc 


ggcgagggtc 


aagtactttt 


agtgctcaac 


360 


cgttatcgac 


gcgatctaga cttctaaagt gcacttttgt 


gcgctgcctg 


cgaagactcg 


420 


accaagacat 


tcgagtcggt 


cgcgggcatt 


ttttattttc 


gcggccgagt 


gtccaccttc 


480 


atccatgagg 


agaaatcact 










500 



<210> 20 

<211> 500 

<212> DNA 

<213> Corynebacterium glutamicum 



<400> 20 
tggattgtct 


ggcggacaaa 


gttcgcaccg ataaacccgg 


caccaccggt 


cacaagcaaa 


60 


gaagtcatgg 


tcgtcatagt 


agttgcggct cctgcagttg 


cgcgtgttat 


cgctgacggg 


120 


catgtcaatt 


cagccgtgac ccagcttcgg tgcggtgtga 


aaacgtatgg 


ggaaaagcca 


180 


cggatagtgc 


atgcccactt 


ctccacatgg gctgtgctca 


gctgtcgtag gggccacggc 


240 


ttgcggactt 


tctgaacacc 


cggacgaaag cctccattat 


tgatggcacc 


cacccggggg 


300 


tggtatcggt 


tgctaagttc 


acagtggtcg gagagaataa 


gtcagcactt 


ctgattacat 


360 


gtacgaaatc 


agggcggtca 


aatgccagtt cccacccggc 


ttcactaagg 


cgggacgacc 


420 


atagaaacgc 


agcttctgac attgaatgcg ccggtttgct 


atggactaca 


ccgcatatga 


480 


ggaaagggct 


tgaaacgcac 








500 



<210> 21 
<211> 500 
<212> DNA 
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<213> Corynebacterium glutamicum 



<400> 21 
cttaatgagt 


acggcgtggc ttatatcggt ttcgctccgt ttgaaatcgg catgaccatc 


60 


atcqcqccaa 


tcqcqqtqct cqcaqqcttt actatqqgqt tqgqttqqgc qtcgttqatt 


120 


gttgcgatcg 


tgatttttgg cctcgcgtgg ggtctgaagt ggttgccgga gcgcggacat 


180 


gtccgcggcg 


agggtaagcc gcaataaagg ttggaagcgc cgggtctagg tccggcgctt 


240 


ctttcgtacg 


cttttcgacg cctccctcca cgtaatatta aagttacggg ttttccctga 


300 


tgcttaagtg 


gtagtcagtg cttaaacttg actgcggtcc actcaattta ttttcaaatt 


360 


ttttgaactt 


gagtggaaca tactcaactc tttgtgcgtt atagatatta gagagttaaa 


420 . 


taatggcgct 


tgacctgcag gaaattgaga tcaacactga ttgtgtaggt tggcgcccaa 


480 


caaagaaagg 


gcgttgaaag 


500 



<210> 22 

<211> 500 

<212> DNA 

<213> Corynebacterium glutamicum 



<400> 22 

tggccattcc tgtggtccat gctcgccctg ttcttcttca ctggactggg caacgccggc 60 

acattcaaac aaatgcccat gattttgccc aaacgccaag caggtggcgt gatcggctgg 120 

accggtgcca ttggtgcctt cggccccttc attgtcggtg tcttgctctc cttcactcca 180 

actgtcgcgt tcttctgggg ctgcgtggtg ttcttcatca tcgccaccgc tttgacctgg 240 

atctactacg cccgcccgaa cgctccattc ccgggataaa ccgaaaggcc aatccatgac 300 

tacaactact tcttctggga agtcttctga acagtcttct gaaaagatca accccctctt 360 

caagctcggc agtttcctaa gaaaaggcac cgtcggttct gaaggccagc agattttcct 420 

tcagggcgga cgccaagccg atgtgttttt atcgcaaccc gatgggcgtt cgataaaagt 480 

cgtgcgctcc cacacatggc 500 

<210> 23 

<211> 1285 

<212> DNA 

<213> Escherichia coli 
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<400> 23 

tgcataatgt gcctgtcaaa tggacgaagc agggattctg caaaccctat gctactccgt 60 

caagccgtca attgtctgat tcgttaccaa ttatgacaac ttgacggcta catcattcac 120 

tttttcttca caaccggcac ggaactcgct cgggctggcc ccggtgcatt ttttaaatac 180 

ccgcgagaaa tagagttgat cgtcaaaacc aacattgcga ccgacggtgg cgataggcat 24 0 

ccgggtggtg ctcaaaagca gcttcgcctg gctgatacgt tggtcctcgc gccagcttaa 300 

gacgctaatc cctaactgct ggcggaaaag atgtgacaga cgcgacggcg acaagcaaac 360 

atgctgtgcg acgctggcga tatcaaaatt gctgtctgcc aggtgatcgc tgatgtactg 420 

acaagcctcg cgtacccgat tatccatcgg tggatggagc gactcgttaa tcgcttccat 4 80 

gcgccgcagt aacaattgct caagcagatt tatcgccagc agctccgaat agcgcccttc 540 

cccttgcccg gcgttaatga tttgcccaaa caggtcgctg aaatgcggct ggtgcgcttc 600 

atccgggcga aagaaccccg tattggcaaa tattgacggc cagttaagcc attcatgcca 660 

gtaggcgcgc ggacgaaagt aaacccactg gtgataccat tcgcgagcct ccggatgacg 720 

accgtagtga tgaatctctc ctggcgggaa cagcaaaata tcacccggtc ggcaaacaaa 780 

ttctcgtccc tgatttttca ccaccccctg accgcgaatg gtgagattga gaatataacc 84 0 

tttcattccc agcggtcggt cgataaaaaa atcgagataa ccgttggcct caatcggcgt 900 

taaacccgcc accagatggg cattaaacga gtatcccggc agcaggggat cattttgcgc 960 

ttcagccata cttttcatac tcccgccatt cagagaagaa accaattgtc catattgcat 1020 

cagacattgc cgtcactgcg tcttttactg gctcttctcg ctaaccaaac cggtaacccc 1080 

gcttattaaa agcattctgt aacaaagcgg gaccaaagcc atgacaaaaa cgcgtaacaa 1140 

aagtgtctat aatcacggca gaaaagtcca cattgattat ttgcacggcg tcacactttg 1200 

ctatgccata gcatttttat ccataagatt agcggatcct acctgacgct ttttatcgca 1260 

actctctact gtttctccat acccg 1285 

<210> 24 
<211> 1285 
<212> DNA 

<213> Escherichia coli 



<220> 
<221> 
<222> 



CDS 

(266) . . (1192} 
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<400> 24 

cgggtatgga gaaacagtag agagttgcga taaaaagcgt caggtaggat ccgctaatct 60 

tatggataaa aatgctatgg catagcaaag tgtgacgccg tgcaaataat caatgtggac 120 

ttttctgccg tgattataga cacttttgtt acgcgttttt gtcatggctt tggtcccgct 180 

ttgttacaga atgcttttaa taagcggggt taccggtttg gttagcgaga agagccagta 240 

aaagacgcag tgacggcaat gtctg atg caa tat gga caa ttg gtt tct tct 292 

Met Gin Tyr Gly Gin Leu Val Ser Ser 

1 5 

ctg aat ggc ggg agt atg aaa agt atg get gaa gcg caa aat gat ccc 340 

Leu Asn Gly Gly Ser Met Lys Ser Met Ala Glu Ala Gin Asn Asp Pro 

10 15 20 25 

ctg ctg ccg gga tac teg ttt aat gec cat ctg gtg gcg ggt tta acg 388 
Leu Leu Pro Gly Tyr Ser Phe Asn Ala His Leu Val Ala Gly Leu Thr 

30 35 . 40 

ccg att gag gee aac ggt tat etc gat ttt ttt ate gac cga ccg ctg 436 

Pro He Glu Ala Asn Gly Tyr Leu Asp Phe Phe He Asp Arg Pro Leu 

45 50 55 

gga atg aaa ggt tat att etc aat etc ace att cgc ggt cag ggg gtg 48 4 

Gly Met Lys Gly Tyr He Leu Asn Leu Thr He Arg Gly Gin Gly Val 

60 65 70 

» 

gtg aaa aat cag gga cga gaa ttt gtt tgc cga ccg ggt gat att ttg 532 

Val Lys Asn Gin Gly Arg Glu Phe Val Cys Arg Pro Gly Asp He Leu 

75 80 85 

ctg ttc ccg cca gga gag att cat cac tac ggt cgt cat ccg gag get 580 

Leu Phe Pro Pro Gly Glu He His His Tyr Gly Arg His Pro Glu Ala 

90 95 100 105 

cgc gaa tgg tat cac cag tgg gtt tac ttt cgt .ccg cgc gee tac tgg 628 

Arg Glu Trp Tyr His Gin Trp Val Tyr Phe Arg Pro Arg Ala Tyr Trp 

110 115 120 

cat gaa tgg ctt aac tgg ccg tea ata ttt gee aat acg ggg ttc ttt 676 

His Glu Trp Leu Asn Trp Pro Ser lie Phe Ala Asn Thr Gly Phe Phe 

125 130 135 

cgc ccg gat gaa gcg cac cag ccg cat ttc age gac ctg ttt ggg caa 724 

Arg Pro Asp Glu Ala His Gin Pro His Phe Ser Asp Leu Phe Gly Gin 

140 145 150 

ate att aac gee ggg caa ggg gaa ggg cgc tat teg gag ctg ctg gcg 772 

lie He Asn Ala Gly Gin Gly Glu Gly Arg Tyr Ser Glu Leu Leu Ala 

155 160 165 

ata aat ctg ctt gag caa ttg tta ctg egg cgc atg gaa gcg att aac 820 

He Asn Leu Leu Glu Gin Leu Leu Leu Arg Arg Met Glu Ala He Asn 

170 175 180 185 

gag teg etc cat cca ccg atg gat aat egg gta cgc gag get tgt cag 868 

Glu Ser Leu His Pro Pro Met Asp Asn Arg Val Arg Glu Ala Cys Gin 

190 195 200 
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tac ate age gat cac ctg gca gac age aat ttt gat ate gee age gtc 916 
Tyr lie Ser Asp His Leu Ala Asp Ser Asn Phe Asp lie Ala Ser Val 

205 210 215 

gca cag cat gtt tgc ttg teg ccg teg cgt ctg tea cat ctt ttc cgc 964 
Ala Gin His Val Cys Leu Ser Pro Ser Arg Leu Ser His Leu Phe Arg 
220 225 230 

cag cag tta ggg att age gtc tta age tgg cgc gag gac caa cgt ate 1012 
Gin Gin Leu Gly lie Ser Val Leu Ser Trp Arg Glu Asp Gin Arg He 
235 240 245 

age cag gcg aag ctg ctt ttg age acc acc egg atg cct ate gee ace 1060 
Ser Gin Ala Lys Leu Leu Leu Ser Thr Thr Arg Met Pro He Ala Thr 
250 255 260 265 

gtc ggt cgc aat gtt ggt ttt gac gat caa etc tat ttc teg egg gta 1108 
Val Gly Arg Asn Val Gly Phe Asp Asp Gin Leu Tyr Phe Ser Arg Val 

270 275 .280 

ttt aaa aaa tgc acc ggg gee age ccg age gag ttc cgt gee ggt tgt 1156 
Phe Lys Lys Cys Thr Gly Ala Ser Pro Ser Glu Phe Arg Ala Gly Cys 

285 290 295 

gaa gaa aaa gtg aat gat gta gec gtc aag ttg tea taattggtaa 1202 
Glu Glu Lys Val Asn Asp Val Ala Val Lys Leu Ser 
300 ' 305. 

cgaatcagac aattgaegge ttgacggagt agcatagggt ttgeagaate cctgcttcgt 1262 

ccatttgaca ggcacattat gca 1285 

<210> 25 

<211> 309 

<212> PRT 

<213> Escherichia coli 



<400> 25 



Met Gin Tyr Gly Gin Leu Val Ser Ser Leu Asn Gly Gly Ser Met Lys 
15 10 15 



Ser Met Ala Glu Ala Gin Asn Asp Pro Leu Leu Pro Gly Tyr Ser Phe 

20 *" 25 30 



Asn Ala His Leu Val Ala Gly Leu Thr Pro He Glu Ala Asn Gly Tyr 
35 " 40 45 



Leu Asp Phe Phe He Asp Arg Pro Leu Gly Met Lys Gly Tyr He Leu 
50 55 60 



Asn Leu Thr He Arg Gly Gin Gly Val Val Lys Asn Gin Gly Arg Glu 
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65 



70 



75 



80 



Phe Val Cys Arg Pro Gly Asp lie Leu Leu Phe Pro Pro Gly Glu lie 

85 90 95 



His His Tyr Gly Arg His Pro Glu Ala Arg Glu Trp Tyr His Gin Trp 

100 105 110 



Val Tyr Phe Arg Pro Arg Ala Tyr Trp His Glu Trp Leu Asn Trp Pro 
115 120 125 



Ser lie Phe Ala Asn Thr Gly Phe Phe Arg Pro Asp Glu Ala His Gin 
130 135 140 



Pro His Phe Ser Asp Leu Phe Gly Gin lie He Asn Ala Gly Gin Gly 
145 150 155 160 



Glu Gly Arg Tyr Ser Glu Leu Leu Ala He Asn Leu Leu Glu Gin Leu 

165 170 175 



Leu Leu Arg Arg Met Glu Ala He Asn Glu Ser Leu His Pro Pro Met 

180 185 190 



Asp Asn Arg Val Arg Glu Ala Cys Gin Tyr He Ser Asp His Leu Ala 
195 200 205 



Asp Ser Asn Phe Asp He Ala Ser Val Ala Gin His Val Cys Leu Ser 
210 215 220 



Pro Ser Arg Leu Ser His Leu Phe Arg Gin Gin Leu Gly He Ser Val 
225 230 235 240 



Leu Ser Trp Arg Glu Asp Gin Arg He Ser Gin Ala Lys Leu Leu Leu 

245 250 255 



Ser Thr Thr Arg Met Pro He Ala Thr Val Gly Arg Asn Val Gly Phe 

260 265 270 



Asp Asp Gin Leu Tyr Phe Ser Arg Val Phe Lys Lys Cys Thr Gly Ala 
275 " 280 285 



Ser Pro Ser Glu Phe Arg Ala Gly Cys Glu Glu Lys Val Asn Asp Val 
290 295 300 



Ala Val Lys Leu Ser 
305 
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<210> 26 
<211> 56 
<212> DNA 
<213> Synthetic 

<220> 

<223> lacl q -trc promoter 
<400> 26 

ttgacaatta atcatccggc tcgtataatg tgtggaattg tgagcggata acaatt 56 

<210> 27 

<211> 55 

<212> DNA 

<213> Synthetic 

<220> 

<223> lacl q -tac promoter 
<400> 27 

ttgacaatta atcatcggct cgtataatgt gtggaattgt gagcggataa caatt 55 

<210> 28 

<211> 1092 

<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)..(1089) 

<400> 28 

gtg gtg aat gtg aaa cca gta acg tta tac gat gtc gca gag tat gcc 48 

Val Val Asn Val Lys Pro Val Thr Leu Tyr Asp Val Ala Glu Tyr Ala 

1 5 10 15 



ggt gtc tct tat cag acc gtt tec cgc gtg gtg aac cag gcc age cac 96 
Gly Val Ser Tyr Gin Thr Val Ser Arg Val Val Asn Gin Ala Ser His 
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20 25 30 

gtt tct gcg aaa acg egg gaa aaa gtg gaa gcg gcg atg gcg gag ctg 14 4 

Val Ser Ala Lys Thr Arg Glu Lys Val Glu Ala Ala Met Ala Glu Leu 
35 40 45 

aat tac att ccc aac cgc gtg gca caa caa ctg gcg ggc aaa cag teg 192 

Asn Tyr lie Pro Asn Arg Val Ala Gin Gin Leu Ala Gly Lys Gin Ser 
50 55 60 

ttg ctg att ggc gtt gec ace tec agt ctg gec ctg cac gcg ccg teg 240 

Leu Leu lie Gly Val Ala Thr Ser Ser Leu Ala Leu His Ala Pro Ser 

65 70 75 80 

caa att gtc gcg gcg att aaa tct cgc gee gat caa ctg ggt gec age 288 

Gin lie Val Ala Ala lie Lys Ser Arg Ala Asp Gin Leu Gly Ala Ser 

85 90 95 

gtg gtg gtg teg atg gta gaa cga age ggc gtc gaa gee tgt aaa gcg 336 

Val Val Val Ser Met Val Glu Arg Ser Gly Val Glu Ala Cys Lys Ala 

100 105 110 

gcg gtg cac aat ctt etc gcg caa cgc gtc agt ggg ctg ate att aac 384 

Ala Val His Asn Leu Leu Ala Gin Arg Val Ser Gly Leu lie lie Asn 
115 120 125 

tat ccg ctg gat gac cag gat gee att get gtg gaa get gee tgc act 432 

Tyr Pro Leu Asp Asp Gin Asp Ala lie Ala Val Glu Ala Ala Cys Thr 
130 135 140 

aat gtt ccg gcg tta ttt ctt gat gtc tct gac cag aca ccc ate aac 480 

Asn Val Pro Ala Leu Phe Leu Asp Val Ser Asp Gin Thr Pro He Asn 

145 150 155 160 

agt att att ttc tec cat gaa gac ggt acg cga ctg ggc gtg gag cat 528 

Ser He He Phe Ser His Glu Asp Gly Thr Arg Leu Gly Val Glu His 

165 170 . 175 

ctg gtc gca ttg ggt cac cag caa ate gcg ctg tta gcg ggc cca tta 576 

Leu Val Ala Leu Gly His Gin Gin He Ala Leu Leu Ala Gly Pro Leu 

180 185 190 

agt tct gtc teg gcg cgt ctg cgt ctg get ggc tgg cat aaa tat etc . 624 

Ser Ser Val Ser Ala Arg Leu Arg Leu Ala Gly Trp His Lys Tyr Leu 
195 200 205 

act cgc aat caa att cag ccg ata gcg gaa egg gaa ggc gac tgg agt 672 

Thr Arg Asn Gin He Gin Pro He Ala Glu Arg Glu Gly Asp Trp Ser 
210 215 220 

gec atg tec ggt ttt caa caa ace atg caa atg ctg aat gag ggc ate 720 

Ala Met Ser Gly Phe Gin Gin Thr Met Gin Met Leu Asn Glu Gly He 

225 230 235 240 

gtt ccc act gcg atg ctg gtt gee aac gat cag atg gcg ctg ggc gca 7 68 

Val Pro Thr Ala Met Leu Val Ala Asn Asp Gin Met Ala Leu Gly Ala 

245 250 255 

atg cgc gee att ace gag tec ggg ctg cgc gtt ggt gcg gat ate teg 816 

Met Arg Ala He Thr Glu Ser Gly Leu Arg Val Gly Ala Asp He Ser 

260 265 ** 270 
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gta gtg gga tac gac gat 
Val Val Gly Tyr Asp Asp 
275 

tea acc acc ate aaa cag 
Ser Thr Thr lie Lys Gin 
290 

gac cgc ttg ctg caa etc 
Asp Arg Leu Leu Gin Leu 
305 310 

ctg ttg ccc gtc tea ctg 
Leu Leu Pro Val Ser Leu 

325 

acg caa acc gec tct ccc 
Thr Gin Thr Ala Ser Pro 

340 

gca cga cag gtt tec cga 
Ala Arg Gin Val Ser Arg 
355 



<210> 29 

<211> 363 

<212> PRT 

<213> Escherichia coli 
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acc gaa gac age tea tgt 

Thr Glu Asp Ser Ser Cys 
280 

gat ttt cgc ctg ctg ggg 

Asp Phe Arg Leu Leu Gly 

295 300 

tct cag ggc cag gcg gtg 
Ser Gin Gly Gin Ala Val 

315 

gtg aaa aga aaa acc acc 
Val Lys Arg Lys Thr Thr 

330 

cgc gcg ttg gee gat tea 
Arg Ala Leu Ala Asp Ser 
345 

ctg gaa age ggg cag tga 
Leu Glu Ser Gly Gin 
360 



tat ate ccg ccg 864 

Tyr lie Pro Pro 

285 

caa acc age gtg 912 
Gin Thr Ser Val 



aag ggc aat cag 960 
Lys Gly Asn Gin 

320 

ctg gcg ccc aat 1008 
Leu Ala Pro Asn 
335 

tta atg cag ctg 1056 
Leu Met Gin Leu 
350 

1092 



<400> 29 

Val Val Asn Val Lys Pro Val Thr Leu Tyr Asp Val Ala Glu Tyr Ala 
15 10 15 



Gly Val Ser Tyr Gin Thr Val Ser Arg Val Val Asn Gin Ala Ser His 

20 25 30 



Val Ser Ala Lys Thr Arg Glu Lys Val Glu Ala Ala Met Ala Glu Leu 
35 40 45 



Asn Tyr lie Pro Asn Arg Val Ala Gin Gin Leu Ala Gly Lys Gin Ser 
50 55 60 



Leu Leu lie Gly Val Ala Thr Ser Ser Leu Ala Leu His Ala Pro Ser 
65 70 75 80 



Gin He Val Ala Ala He Lys Ser Arg Ala Asp Gin Leu Gly Ala Ser 

85 90 95 



Val Val Val Ser Met Val Glu Arg Ser Gly Val Glu Ala Cys Lys Ala 
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100 



105 



110 



Ala Val His Asn Leu Leu Ala Gin Arg Val Ser Gly Leu He He Asn 
115 120 125 



Tyr Pro Leu Asp Asp Gin Asp Ala He Ala Val Glu Ala Ala Cys Thr 
130 135 140 



Asn Val Pro Ala Leu Phe Leu Asp Val Ser Asp Gin Thr Pro He Asn 
145 150 155 160 



Ser He He Phe Ser His Glu Asp Gly Thr Arg Leu Gly Val Glu His 

165 170 175 



Leu Val Ala Leu Gly His Gin Gin He Ala Leu Leu Ala Gly Pro Leu 

180 185 190 



Ser Ser Val Ser Ala Arg Leu Arg Leu Ala Gly Trp His Lys Tyr Leu 
195 ~ 200 205 



Thr Arg Asn Gin He Gin Pro He Ala Glu Arg Glu Gly Asp Trp Ser 
210 215 220 



Ala Met Ser Gly Phe Gin Gin Thr Met Gin Met Leu Asn Glu Gly He 
225 230 235 240 



Val Pro Thr Ala Met Leu Val Ala Asn Asp Gin Met Ala Leu Gly Ala 

245 250 255 



Met Arg Ala He Thr Glu Ser Gly Leu Arg Val Gly Ala Asp He Ser 

260 265 270 



Val Val Gly Tyr Asp Asp Thr Glu Asp Ser Ser Cys Tyr He Pro Pro 
275 280 " 285 



Ser Thr Thr He Lys Gin Asp Phe Arg Leu Leu Gly Gin Thr Ser Val 
290 295 300 



Asp Arg Leu Leu Gin Leu Ser Gin Gly Gin Ala Val Lys Gly Asn Gin 
305 310 315 320 



Leu Leu Pro Val Ser Leu Val Lys Arg Lys Thr Thr Leu Ala Pro Asn 

325 330 335 



Thr Gin Thr Ala Ser Pro Arg Ala Leu Ala Asp Ser Leu Met Gin Leu 

340 345 350 
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Ala Arg Gin Val Ser Arg Leu Glu Ser Gly Gin 
355 360 

<210> 30 

<211> 1511 

<212> DNA 

<213> Escherichia coli 



<400> 30 
catgcattta 


cgttgacacc 


atcgaatggt 


gcaaaacctt tcgcggtatg 


gcatgatagc 


60 


gcccggaaga 


gagtcaattc 


agggtggtga 


atgtgaaacc agtaacgtta 


tacgatgtcg 


120 


cagagtatgc 


cggtgtctct 


tatcagaccg 


tttcccgcgt ggtgaaccag 


gccagccacg 


180 


tttctgcgaa 


aacgcgggaa 


aaagtggaag 


cggcgatggc ggagctgaat 


tacattccca 


240 


accgcgtggc 


acaacaactg 


gcgggcaaac 


agtcgttgct gattggcgtt 


gccacctcca 


300 


gtctggccct 


gcacgcgccg 


tcgcaaattg 


tcgcggcgat taaatctcgc 


gccgatcaac 


360 


tgggtgccag 


cgtggtggtg 


tcgatggtag 


aacgaagcgg cgtcgaagcc 


tgtaaagcgg 


420 


cggtgcacaa 


tcttctcgcg 


caacgcgtca 


gtgggctgat cattaactat 


ccgctggatg 


480 


accaggatgc 


cattgctgtg 


gaagctgcct 


gcactaatgt tccggcgtta 


tttcttgatg 


540 


tctctgacca 


gacacccatc 


aacagtatta 


ttttctccca tgaagacggt 


acgcgactgg 


600 


gcgtggagca 


tctggtcgca 


ttgggtcacc 


agcaaatcgc gctgttagcg 


ggcccattaa 


660 


gttctgtctc 


ggcgcgtctg 


cgtctggctg 


gctggcataa atatctcact 


cgcaatcaaa 


720 


ttcagccgat 


agcggaacgg 


gaaggcgact 


ggagtgccat gtccggtttt 


caacaaacca 


780 


tgcaaatgct 


gaatgagggc 


atcgttccca 


ctgcgatgct ggttgccaac 


gatcagatgg 


840 


cgctgggcgc 


aatgcgcgcc 


attaccgagt 


ccgggctgcg cgttggtgcg 


gatatctcgg 


900 


tagtgggata 


cgacgatacc 


gaagacagct 


catgttatat cccgccgtca 


accaccatca 


960 


aacaggattt 


tcgcctgctg 


gggcaaacca 


gcgtggaccg cttgctgcaa 


ctctctcagg 


1020 


gccaggcggt 


gaagggcaat 


cagctgttgc 


• 

ccgtctcact ggtgaaaaga 


aaaaccaccc 


1080 


tggcgcccaa 


tacgcaaacc 


gcctctcccc 


gcgcgttggc cgattcatta 


atgcagctgg 


1140 


cacgacaggt 


ttcccgactg 


gaaagcgggc 


agtgagcgca acgcaattaa 


tgtgagttag 


1200 


cgcgaattga 


tctggtttga 


cagcttatca 


tcgactgcac ggtgcaccaa 


tgcttctggc 


1260 


gtcaggcagc 


catcggaagc 


tgtggtatgg 


ctgtgcaggt cgtaaatcac 


tgcataattc 


1320 


gtgtcgctca 


aggcgcactc 


ccgttctgga 


taatgttttt tgcgccgaca 


tcataacggt 


1380 


tctggcaaat 


attctgaaat 


gagctgttga 


caattaatca tccggctcgt 


ataatgtgtg 


1440 
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gaattgtgag cggataacaa tttcacacag gaaacagacc atggaattcg agctcggtac 1500 
ccggggatcc t 1511 

<210> 31 

<211> 26 

<212> DNA 

<213> Artificial 



<220> 

<223> Oligonucleotide forward primer 

<400> 31 

gtgcggatcc tggctttcca acgttt 26 

<210> 32 

<211> 26 

<212> DNA 

<213> Artificial 



<220> 

<223> Oligonucleotide reverse primer 

<400> 32 

catgggctcg agatgacctg tgccta 26 

<210> 33 

<211> 544 

<212> DNA 

<213> Corynebacterium glutamicum 



<400> 33 

ctgggaaggg taccgcctaa tcctcaccgc attacttgga acaatcgtgc agcagtattc 60 

ctttaacgct ggcgaactac aaaaatcgct acccgccatg accattgccg aaccaattgt 120 

cgccttcagt ttgggctact tggttctggg cgaaaaattc caagtcgtgg actgggaatg 180 

gatcgccatg ggcatcgcac tcttggtgat gattgtttcc accattgcac tgtctcgtac 240 

aagcacaatg ccggccggat cgaaaaggta aaactccaaa gttccccccc cgagacatga 300 
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cagcactgga actgggcgtc gaaaagcttt tttaaaagaa aactcccccc gagttgctac 360 

ccacaccaca aagttgttgt atgcttcacc acatgacttc gcgtgcgaat ctacttcttc 420 

ttcgccgcgg cgggtcccag aggtcttaac acgaccggca tcccgtcgcg gagtttggtg 480 

ttgccggtcg tggacccacc caaaactttt taagaaggtt ctagacaatg tcttctaacg 540 
atgc 



<210> 34 

<211> 500 

<212> DNA 

<213> Corynebacterium glutamicum 



<210> 35 

<211> 500 

<212> DNA 

<213> Corynebacterium glutamicum 



544 



<400> 34 
gaaccaatca 


ctggtacctc ggacaaccga 


tcgatcagtg cttgctgctc ttcgctgatt 


60 


tcatcgtcat 


catcgctggt 


gtctttgccg 


gaggagctca agctcggggc 


cgcaggaatg 


120 


tcgccactgc 


tgagcattga 


gctgccttca 


gagctgcctg gccaggtttc 


gtttccatcg 


180 


actggttttc 


catcatcatc 


aaggatctgt 


gatgaggtga tgttgtctga 


gagctgtgtc 


240 


agtgcgtcag 


aggactgagc 


ctgggcaact 


ggagtgaaca cggacaatgc 


cacagcgctt 


300 


gctgtaacaa 


gggtcaaagt 


acttcgacgc 


aaagacaaaa cttttctcct 


ggcaataaat 


360 


atgcggattt 


actatggaaa 


caagatagaa 


gattggatag cgaaagctat 


cctcaactcg 


420 


tggaaagtgt 


aatgccacaa 


ccacagtatt 


ggctagaaaa caatctatag 


cattgttcta 


480 


caaagagctc 


gttggaaata 








500 



<400> 35 

ctacatcaaa accggtaccg acaatccaat tccacaatga atagagcaaa tattgaatgg 60 

gtacgcctaa aatcatgagc caagattagc gctgaaaagt agcgggagcc tgcctgaact 120 

ttgtgagaat cctgattcct taaccgaagt gggggagttt tgggggtggg aattttcgtg 180 

cgttgtggaa ttggaaactc gatgtgtgta gcatgacaca ccatgaccat tattcgactt 240 

gtagtagtaa ccgcgcggcg cctgccgtaa cggccttcca agtcgtctcg tcaagcgccc 300 
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tcgacaacac tcaccacagt gttggaacga gggctttctt gttggttatg acccaagtag .; -360^ 

ccaactttgc aacagacatc tgtcgcactg cgtgcacacg catccgcgtc ggaacaatt't 420 

taaatgaggg ctttgtcttt aggctgagtt gaaatcggct tggcttggac gggtcctgtg* 4 80 

aaaatcctta tttagtaaag 500 
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an extent that no meaningful International Search can be carried out, specifically: 



3. I Claims Nos.: 

— because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 



see additional sheet 



1 . I I As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
■ — ' searchable claims. 



2. [ [ As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
■ — ' covers oniy those claims for whfch fees were paid, specifically claims Nos.: 



4. I yj No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 

1-38 partially 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

Invention 1: Claims (1-38) - partially 

Isolated Corynebacteri urn glutamicum promoter comprising SEQ 
ID NO: 4, homologs and fragments; fusions, vectors containing 
them, host cells carrying the vectors and use of said cells 
for production of biosynthetic products. 



Inventions 2-19: Claims (1-38) - partially 

Idem as invention 1, but restricted to SEQ ID N0s:5-22 
respectively. 

Invention 20: Claims (39-45) - completely 

Isolated Corynebacteri urn species chromosome comprising: a 
first nucleic acid integrated into said chromosome, the 
sequence of which is at least 90% identical to SEQ ID N0:1, 
and a second nucleic acid integrated into said chromosome, 
wherein said second nucleic acid encodes a polypeptide which 
functions as a component of an amino acid biosynthesis 
pathway; and wherein said first nucleic acid regulates 
transcription of said second nucleic acid. Host cell 
comprising said chromosome and use thereof to produce an 
amino acid. Vector comprising the first and second 
poynucleotides mentioned above. 



Invention 21: Claims (46-51) - completely 

Isolated polynucleotide comprising: a first nucleic acid, 
the sequence of which is at least 90% identical to SEQ ID 
N0:2, and a second nucleic acid operably associated with 
said first nucleic acid, wherein said second nucleic acid 
encodes a polypeptide which functions as a component of a 
lysine biosynthesis pathway; wherein said first nucleic acid 
regulates transcription of said second nucleic acid. Vector 
comprising the polynucleotide and host cell transformed with 
it. 



Invention 22: Claims (52-58) - completely 

Isolated polynucleotide comprising: a first nucleic acid, 
the sequence of which is at least 90% identical to SEQ ID 
N0:3, and a second nucleic acid operably associated with 
said first nucleic acid, wherein said second nucleic acid 
encodes a polypeptide which functions as a component of an 
amino acid biosynthesis pathway; wherein said first nucleic 
acid regulates transcription of said second nucleic acid. 
Vector comprising the polynucleotide, host cell transformed 
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with it and use of said transformant to produce an amino 
acid. 

Invention 23: Claims (59-66) - completely 

Isolated Corynebacterium species chromosome comprising: a 
first nucleic acid integrated into said chromosome, the 
sequence of which is at least 90% identical to SEQ ID N0:23, 
and a second nucleic acid integrated into said chromosome in 
operable association with said first nucleic acid, wherein 
said second nucleic acid encodes a polypeptide which 
functions as a component of an amino acid biosynthesis 
pathway; and wherein said first nucleic acid regulates 
transcription of said second nucleic acid. Host cell 
comprising said chromosome and use thereof to produce an 
amino acid. Vector comprising the first and second 
poynucleotides mentioned above. 



Invention 24: Claims (67-73) - completely 

Isolated Corynebacterium species chromosome comprising: a 
first nucleic acid integrated into said chromosome, the 
sequence of which is identical to SEQ ID NO: 26 or to SEQ ID 
NO: 27, a second nucleic acid integrated into said chromosome 
in operable association with said first nucleic acid, 
wherein said second nucleic acid encodes a polypeptide which 
functions as a component of an amino acid biosynthesis 
pathway, and a third nucleic acid, operably associated to 
the first nucleic acid and the sequence of which is at least 
90% identical to SEQ ID NO: 28; wherein said first nucleic 
acid regulates transcription of said second nucleic acid, 
and wherein said third nucleic acid encodes a lac repressor 
polypeptide. Host cell comprising said chromosome and use 
thereof to produce an amino acid. Vector comprising the 
first, the second and the third poynucleotides mentioned 
above . 
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